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Jurgen Ehlers 



In the 1950s the mathematical department of Hamburg University, with its 
stars Artin, Blaschke, Collatz, Kahler, Peterson, Sperner and Witt had a 
strong drawing power for Jiirgen Ehlers, student of mathematics and physics. 
Since he had impressed his teachers he could well have embarked on a dis- 
tinguished career in mathematics had it not been for Pascual Jordan and - I 
suspect - Hermann Weyl’s Space-Time-Matter. 

Jordan had just published his book “Schwerkraft und Weltall” which was 
a text on Einstein’s theory of gravitation, developing his theory of a variable 
gravitational “constant”. Only the rudiments of this theory had been for- 
mulated and Jordan, overburdened with countless extraneous commitments, 
was eager to find collaborators to develop his theory. This opportunity to 
break new ground in physics enticed Jiirgen Ehlers and Wolfgang Kundt to 
help Jordan with his problems, and their work was acknowledged in the 1955 
second edition of Jordan’s book. 

It didn’t take Jiirgen, who always was a systematic thinker, long to re- 
alize that not only Jordan’s generalization but also Einstein’s theory itself 
needed a lot more work. This impression was well described by Kurt Goedel 
in 1955 in a letter to Carl Seelig: “My own work in relativity theory refers 
to the pure gravitational theory of 1916 of which I believe that it was left by 
Einstein himself and the whole contemporary generation of physicists as a 
torso - and in every respect, physically, mathematically, and its applications 
to cosmology” . 

When asked by Seelig to elaborate , Goedel added: “Concerning the com- 
pletion of gravitational theory of which I wrote in my last letter I do not 
mean a completion in the sense that the theory would cover a larger domain 
of phenomena (Tatsachenbereich), but a mathematical analysis of the equa- 
tions that would make it possible to attempt their solution systematically 
and to find their general properties. Until now one does not even know the 
analogs of the fundamental integral theorems of Newtonian theory which, 
in my opinion, have to exist without fail. Since such integral theorems and 
other mathematical lemmas would have a physical meaning, the physical un- 
derstanding of the theory would be enhanced. On the other hand, a closer 
analysis of the physical content of the theory could lead to such mathematical 
theorems” . 



VI Jurgen Ehlers 



Such a view of Einstein’s theory was also reflected in the talks and dis- 
cussions of the “Jordan Seminar”. This was a weekly meeting of Jordan’s 
coworkers in the Physics Department of Hamburg University to discuss Jor- 
dan’s theory of a variable gravitational scalar. However, under Jurgen’s lead- 
ership, the structure and interpretation of Einstein’s original theory became 
the principal theme of nearly all talks. Jordan, who found little time to con- 
tribute actively to his theory, reluctantly went along with this change of topic. 
Through grants from the US Air Force and other sources he provided the lo- 
gistic support for his research group. For publication of the lengthy research 
papers on Einstein’s theory of gravitation by Ehlers, Kundt, Ozsvath, Sachs 
and Triimper, he made the proceedings of the Akademie der Wissenschaften 
und der Literatur in Mainz available. Jordan appeared often as coauthor, 
but I doubt whether he contributed much more than suggestions in style, 
like never to start a sentence with a formula. Some results were also written 
up as reports for the Air Force and became known as the Hamburg Bible. 

It was a principal concern in Jurgen’s contributions to Einstein’s theory 
to clarify the mathematics, separate proof from conjecture and insist on in- 
variance as well as elegance. This clear and terse style, which always kept 
physical interpretation in mind, appeared already in his Hamburg papers. 
His work in relativity resulted not only in books, published papers, super- 
vised theses, critical remarks in discussions and suggestions for future work. 
By establishing the “Albert-Einstein-Institut” Jurgen designed a unique in- 
ternational center for research in relativity. As the founding director of this 
“Max-Planck-Institut fiir Gravitationsphysik” in Brandenburg, he has led it 
to instant success. Through his leadership, research on Einstein’s theory in 
Germany is flourishing again and his work and style has set a standard for a 
whole generation of researchers. 



Engelbert Schiicking 



Preface 



The contributions in this book are dedicated to Jurgen Ehlers on the occasion 
of his 70th birthday. I have tried to find topics which were and are near to 
Jurgen’s interests and scientific activities. I hope that the book - even in 
the era of electronic publishing ~ will serve for some time as a review of the 
themes treated; a source from which, for example, a PhD student could learn 
certain things thoroughly. In initiating the project of the book, the model I 
had in mind was the “Witten book” . 

Early in his career Jurgen Ehlers worked on exact solutions, and demon- 
strated how one goes about characterizing exact solutions invariantly and 
searching for their intrinsic geometrical properties. So, it seems appropriate 
to begin the book with the article by J. Bicak: “Selected Solutions of Ein- 
stein’s Field Equations: Their Role in General Relativity and Astrophysics.” 
Certainly not all of the large number of known exact solutions are of equal 
weight; this article describes the most important ones and explains their role 
for the development and understanding of Einstein’s theory of gravity. 

The second contribution is the article by H. Friedrich and A. Rendall: 
“The Cauchy Problem for the Einstein Equations” . It contains a careful ex- 
position of the local theory, including the delicate gauge questions and a 
discussion of various ways of writing the equations as hyperbolic systems. 
Furthermore, it becomes clear that an understanding of the Cauchy problem 
really gives new insight into properties of the equations and the solutions and 
not just “uniqueness and existence”. 

“Post-Newtonian Gravitational Radiation” is the title of the article by 
L. Blanchet. It deals with a topic Jurgen has contributed to and thought 
about deeply. However, these matters have developed in such a way that 
presently only a small number of experts understand all the technical details 
and subtleties. Hopefully, this present contribution will help us gain some 
understanding of certain aspects of post-Newtonian approximations. 

The fourth contribution, “Duality and Hidden Symmetries in Gravita- 
tional Theories”, by D. Maison, outlines how far one of Jiirgen’s creations, 
the “Ehlers transformation” has evolved. From a “trick” to produce new so- 
lutions from known ones, the presence of such transformations in the space of 
solutions is now seen as a structural property of various gravitational theories, 
which at present attract a lot of attention. 



VIII Preface 



The contribution, by R. Beig and B. Schmidt, “Time-Independent Grav- 
itational Fields” collects and describes what is known about global proper- 
ties of time-independent spacetimes. It contains, in particular, a fairly self- 
contained description of the multipole expansion at infinity. 

V. Perlick has written on “Gravitational Tensing from a Geometric View- 
point” . In the last ten years, lensing has become a fascinating new part of 
observational astrophysics. However, there are still important and interesting 
conceptual and mathematical questions when one tries to compare practical 
astrophysical applications with their mathematical modelling in Einstein’s 
theory of gravity. Some of those issues are treated in this contribution. 

Obviously, there are some subjects missing, for which I was not able to find 
a contribution. What I regret most is that there is no article on cosmology, 
a field in which Jiirgen has always been very interested. 

An intruiging thought about the book is that Juergen would have read all 
these contributions before publication and no doubt improved them by his 
constructive criticism. For a short while I had in mind to ask Jurgen to do 
just this, but finally I decided that this would be too much of a burden for a 
birthday present. 

Finally, I would like to thank the authors, friends and colleagues who have 
helped me and have given valuable advice. 

Bernd Schmidt 
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Selected Solutions of Einstein’s Field 
Equations: Their Role in General Relativity 
and Astrophysics 

Jin Bicak 

Institute of Theoretical Physics, 

Charles University, Prague 



1 Introduction and a Few Excursions 

The primary purpose of all physical theory is rooted in reality, and most rela- 
tivists pretend to be physicists. We may often be members of departments of 
mathematics and our work oriented towards the mathematical aspects of Ein- 
stein’s theory, but even those of us who hold a permanent position on “scri” , 
are primarily looking there for gravitational waves. Of course, the builder of 
this theory and its field equations was the physicist. Jurgen Ehlers has always 
been very much interested in the conceptual and axiomatic foundations of 
physical theories and their rigorous, mathematically elegant formulation; but 
he has also developed and emphasized the importance of such areas of rela- 
tivity as kinetic theory, the mechanics of continuous media, thermodynamics 
and, more recently, gravitational lensing. Feynman expressed his view on the 
relation of physics to mathematics as follows [1]: 

“The physicist is always interested in the special case; he is never inter- 
ested in the general case. He is talking about something; he is not talking 
abstractly about anything. He wants to discuss the gravity law in three di- 
mensions; he never wants the arbitrary force case in n dimensions. So a certain 
amount of reducing is necessary, because the mathematicians have prepared 
these things for a wide range of problems. This is very useful, and later on it 
always turns out that the poor physicist has to come back and say, ‘Excuse 
me, when you wanted to tell me about four dimensions...’ ” Of course, this 
is Feynman, and from 1965... 

However, physicists are still rightly impressed by special explicit formulae. 
Explicit solutions enable us to discriminate more easily between a “physical” 
and “pathological” feature. Where are there singularities? What is their char- 
acter? How do test particles and fields behave in given background space- 
times? What are their global structures? Is a solution stable and, in some 
sense, generic? Clearly, such questions have been asked not only within gen- 
eral relativity. 

By studying a special explicit solution one acquires an intuition which, 
in turn, stimulates further questions relevant to more general situations. 
Consider, for example, charged black holes as described by the Reissner- 
Nordstrom solution. We have learned that in their interior a Cauchy horizon 

B.G. Schmidt (Ed.): LNP 540, pp. 1-126, 1999. 
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exists and that the singularities are timelike. We shall discuss this in greater 
detail in Sect. 3.1. The singularities can be seen by, and thus exert an influence 
on, an observer travelling in their neighborhood. However, will this violation 
of the (strong) cosmic censorship persist when the black hole is perturbed 
by weak (“linear”) or even strong (“nonlinear”) perturbations? We shall see 
that, remarkably, this question can also be studied by explicit exact special 
model solutions. Still more surprisingly, perhaps, a similar question can be 
addressed and analyzed by means of explicit solutions describing completely 
diverse situations - the collisions of plane waves. As we shall note in Sect. 
8.3, such collisions may develop Cauchy horizons and subsequent timelike sin- 
gularities. The theory of black holes and the theory of colliding waves have 
intriguing structural similarities which, first of all, stem from the circum- 
stance that in both cases there exist two symmetries, i.e. two Killing fields. 
What, however, about more general situations? This is a natural question 
inspired by the explicit solutions. Then “the poor physicists have to come 
back” to a mathematician, or today alternatively, to a numerical relativist, 
and hope that somehow they will firmly learn whether the cosmic censor- 
ship is the “truth”, or that it has been a very inspirational, but in general 
false conjecture. However, even after the formulation of a conjecture about 
a general situation inspired by particular exact solutions, newly discovered 
exact solutions can play an important role in verifying, clarifying, modifying, 
or ruling out the conjecture. And also “old” solutions may turn out to act 
as asymptotic states of general classes of models, and so become still more 
significant. 

Exact explicit solutions have played a crucial role in the development of 
many areas of physics and astrophysics. Later on in this Introduction we 
will take note of some general features which are specific to the solutions of 
Einstein’s equations. Before that, however, for illustration and comparison 
we shall indicate briefly with a few examples what influence exact explicit 
solutions have had in other physical theories. Our next introductory excur- 
sion, in Sect. 1.2, describes in some detail the (especially early) history of 
Einstein’s route to the gravitational field equations for which his short stay 
in Prague was of great significance. The role of Ernst Mach (who spent 28 
years in Prague before Einstein) in the construction of the first modern cos- 
mological model, the Einstein static universe, is also touched upon. Section 
1.3 is devoted to a few remarks on some old and new impacts of the other 
simplest “cosmological” solutions of Einstein’s equations - the Minkowski, 
the de Sitter, and the anti de Sitter spacetimes. Some specific features of 
solutions in Einstein’s theory, such as the observability and interpretation 
of metrics, the role of general covariance, the problem of the equivalence of 
two metrics, and of geometrical characterization of solutions are mentioned in 
Sect. 1.4. Finally, in the last (sub)sections of the “Introduction” we give some 
reasons why we consider our choice of solutions to be “a natural selection”, 
and we briefly outline the main body of the article. 
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1.1 A Word on the Role of Explicit Solutions 
in Other Parts of Physics and Astrophysics 

Even in a linear theory like Maxwell’s electrodynamics one needs a good 
sample, a useful kit, of exact fields like the homogeneous field, the Coulomb 
field, the dipole, the quadrupole and other simple solutions, in order to gain 
a physical intuition and understanding of the theory. Similarly, of course, 
with the linearized theory of gravity. Going over to the Schrodinger equa- 
tion of standard quantum mechanics, again a linear theory, consider what we 
have learned from simple, explicitly soluble problems like the linear and the 
three-dimensional harmonic oscillator, or particles in potential wells of var- 
ious shapes. We have acquired, for example, a transparent insight into such 
basic quantum phenomena as the existence of minimum energy states whose 
energy is not zero, and their associated wave functions which have a certain 
spatial extent, in contrast to classical mechanics. The three-dimensional prob- 
lems have taught us, among other things, about the degeneracy of the energy 
levels. The case of the harmonic oscillator is, of course, very exceptional since 
Hamiltonians of the same type appear in all problems involving quantized os- 
cillations. One encounters them in quantum electrodynamics, quantum field 
theory, and likewise in the theory of molecular and crystalline vibrations. It is 
thus perhaps not so surprising that the Hamiltonian and the wave functions 
of the harmonic oscillator arise even in the minisuperspace models associated 
with the Hartle-Hawking no-boundary proposal for the wave function of the 
universe [2], and in the minisuperspace model of homogeneous spherically 
symmetric dust filled universes [3]. 

In nonlinear problems explicit solutions play still a greater role since to 
gain an intuition of nonlinear phenomena is hard. Landau and Lifshitz in 
their Fluid Mechanics (Volume 6 of their course) devote a whole section to 
the exact solutions of the nonlinear Navier-Stokes equations for a viscous 
fluid (including Landau’s own solution for a jet emerging from the end of a 
narrow tube into an infinite space filled with fluid). 

Although Poisson’s equation for the gravitational potential in the classical 
theory of gravity is linear, the combined system of equations describing both 
the field and its fluid sources (not rigid bodies, these are simple!) character- 
ized by Euler’s equations and an equation of state are nonlinear. In classical 
astrophysical fluid dynamics perhaps the most distinct and fortunate example 
of the role of explicit solutions is given by the exact descriptions of ellipsoidal, 
uniform density masses of self-gravitating fluids. These “ellipsoidal figures of 
equilibrium” [4] include the familiar Maclaurin spheroids and triaxial Jacobi 
ellipsoids, which are characterized by rigid rotation, and a wider class dis- 
covered by Dedekind and Riemann, in which a motion of uniform vorticity 
exists, even in a frame in which the ellipsoidal surface is at rest. The solutions 
representing the rotating ellipsoids did not only play an inspirational role in 
developing basic concepts of the theory of rigidly rotating stars, but quite un- 
expectedly in the study of inviscid, differentially rotating polytropes. These 
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closely resemble Maclaurin spheroids, although they do not maintain rigid 
rotation. As noted in the well-known monograph on rotating stars [5], “the 
classical work on uniformly rotating, homogeneous spheroids has a range of 
validity much greater than was usually anticipated” . It also influenced galac- 
tic dynamics [6]: the existence of Jacobi ellipsoids suggested that a rapidly 
rotating galaxy may not remain axisymmetric, and the Riemann ellipsoids 
demonstrated that there is a distinction between the rate at which the matter 
in a triaxial rotating body streams and the rate at which the figure of the 
body rotates. Since rotating incompressible ellipsoids adequately illustrate 
the general feature of rotating axisymmetric bodies, they are also used in 
the studies of double stars whose components are close to each other. The 
disturbances caused by a neighbouring component are treated as first order 
perturbations. Relativistic effects on the rotating incompressible ellipsoids 
have been investigated in the post-Newtonian approximation by various au- 
thors, recently with a motivation to understand the coalescence of binary 
neutron stars near their innermost stable circular orbit (see [7] for the latest 
work and a number of references). 

As for the last subject, which has a more direct connection with exact 
explicit solutions of Einstein’s equations, we want to say a few words about 
integrable systems and their soliton solutions. Soliton theory has been one 
of the most interesting developments in the past decades both in physics 
and mathematics, and gravity has played a role both in its birth and recent 
developments. It has been known from the end of the last century that the 
celebrated Korteweg-de Vries nonlinear evolution equation, which governs 
one dimensional surface gravity waves propagating in a shallow channel of 
water, admits solitary wave solutions. However, it was not until Zabusky and 
Kruskal (the Kruskal of Sect. 2.4 below) did extensive numerical studies of 
this equation in 1965 that the remarkable properties of the solitary waves 
were discovered: the nonlinear solitary waves, named solitons by Zabusky 
and Kruskal, can interact and then continue, preserving their shapes and 
velocities. This discovery has stimulated extensive studies of other nonlin- 
ear equations, the inverse scattering methods of their solution, the proof 
of the existence of an infinite number of conservation laws associated with 
such equations, and the construction of explicit solutions (see [8] for a re- 
cent comprehensive treatise). Various other nonlinear equations, similar to 
the sine-Gordon equation or the nonlinear Schrodinger equation, arising for 
example in plasma physics, solid state physics, and nonlinear optics, have 
also been successfully tackled by these methods. At the end of the 1970s sev- 
eral authors discovered that Einstein’s vacuum equations for axisymmetric 
stationary systems can be solved by means of the inverse scattering meth- 
ods, and it soon became clear that one can employ them also in situations 
when both Killing vectors are spacelike (producing, for example, soliton-type 
cosmological gravitational waves). Dieter Maison, one of the pioneers in ap- 
plying these techniques in general relativity, describes the subject thoroughly 
in this volume. We shall briefly meet the soliton methods when we discuss 
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the uniformly rotating disk solution of Neugebauer and Meinel (Sect. 6.3), 
colliding plane waves (Sect. 8.3), and inhomogeneous cosmological models 
(Sect. 12.2). Our aim, however, is to understand the meaning of solutions, 
rather than generation techniques of finding them. From this viewpoint it 
is perhaps first worth noting the interplay between numerical and analytic 
studies of the soliton solutions - hopefully, a good example of an interaction 
for numerical and mathematical relativists. However, the explicit solutions 
of integrable models have played important roles in various other contexts. 
The most interesting mMZtz-dimensional integrable equations are the four- 
dimensional self-dual Yang-Mills equations arising in field theory. Their so- 
lutions, discovered by R. Ward using twistor theory, on one hand stimulated 
Donaldson’s most remarkable work on inequivalent differential structures on 
four-manifolds. On the other hand. Ward indicated that many of the known 
integrable systems can be obtained by dimensional reduction from the self- 
dual Yang-Mills equations. Very recently this view has been substantiated 
in the monograph by Mason and Woodhouse [9]. The words by which these 
authors finely express the significance of exact solutions in integrable systems 
can be equally well used for solutions of Einstein’s equations: “they combine 
tractability with nonlinearity, so they make it possible to explore nonlinear 
phenomena while working with explicit solutions” . ^ 

1.2 Einstein’s Field Equations 

Since Jurgen Ehlers has always been, among other things, interested in the 
history of science, he will hopefully tolerate a few remarks on the early his- 
tory of Einstein’s equations to which not much attention has been paid in 
the literature. It was during his stay in Prague in 1911 and 1912 that Ein- 
stein’s intensive interest in quantum theory diminished, and his systematic 
effort in constructing a relativistic theory of gravitation began. In his first 
“Prague theory of gravity” he assumed that gravity can be described by a 
single function - the local velocity of light. This assumption led to insur- 
mountable difficulties. However, Einstein learned much in Prague on his way 
to general relativity [11]: he understood the local significance of the principle 
of equivalence; he realized that the equations describing the gravitational field 
must be nonlinear and have a form invariant with respect to a larger group 

^ In 1998, in the discussion after his Prague lecture on the present role of physics 
in mathematics, Prof. Michael Atiyah expressed a similar view that even with 
more powerful supercomputers and with a growing body of general mathematical 
results on the existence and uniqueness of solutions of differential equations, the 
exact, explicit solutions of nonlinear equations will not cease to play a significant 
role. (As it is well known. Sir Michael Atiyah has made fundamental contributions 
to various branches of mathematics and mathematical physics, among others, to 
the theory of solitons, instantons, and to the twistor theory of Sir Roger Penrose, 
with whom he has been interacting “under the same roof” in Oxford for 17 years 
[ 10 ].) 
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of transformations than the Lorentz group; and he found that “spacetime 
coordinates lose their simple physical meaning'', i.e. they do not determine 
directly the distances between spacetime points.^ In his “Autobiographical 
Notes” Einstein says: “Why were seven years ... required for the construction 
of general relativity? The main reason lies in the fact that it is not easy to 
free oneself from the idea that coordinates must have an immediate metrical 
meaning”... Either from Georg Pick while still in Prague, or from Marcel 
Grossmann during the autumn of 1912 after his return to Zurich (cf. [11]), 
Einstein learned that an appropriate mathematical formalism for his new the- 
ory of gravity was available in the work of Riemann, Ricci, and Levi-Givita. 
Several months after his departure from Prague and his collaboration with 
Grossmann, Einstein had general relativity almost in hand. Their work [13] 
was already based on the generally invariant line element 

= g^vdx^dx'' (I) 

in which the spacetime metric tensor /i, v,p = Q, 1, 2, 3, plays a dual 

role: on the one hand it determines the spacetime geometry, on the other it 
represents the (ten components of the) gravitational potential and is thus a 
dynamical variable. The disparity between geometry and physics, criticized 
notably by Ernst Mach,^ had thus been removed. When searching for the field 
equations for the metric tensor, Einstein and Grossmann had already real- 
ized that a natural candidate for generally covariant field equations would 
be the equations relating - in present-day terminology - the Ricci tensor 
and the energy-momentum tensor of matter. However, they erroneously con- 
cluded that such equations would not yield the Poisson equation of Newton’s 
theory of gravitation as a first approximation for weak gravitational fields 
(see both §5 in the “Physical part” in [13] written by Einstein and §4, be- 
low equation (46), in the “Mathematical part” by M. Grossmann). Einstein 
then rejected the general covariance. In a subsequent paper with Grossmann 
[14], they supported this mis-step by a well-known “hole” meta-argument and 
obtained (in today’s terminology) four gauge conditions such that the field 
equations were covariant only with respect to transformations of coordinates 
permitted by the gauge conditions. We refer to, for example, [15] for more 
detailed information on the further developments leading to the final version 
of the field equations. Let us only summarize that in late 1915 Einstein first 
readopted the generally covariant field equations from 1913, in which the 
Ricci tensor was, up to the gravitational coupling constant, equal to the 
energy-momentum tensor T^i, (paper submitted to the Prussian Academy on 

^ At that time Einstein’s view on the fnture theory of gravity are best summarized 
in his reply to M. Abraham [12], written just before departure from Prague. 

® Mach spent 28 years as Professor of Experimental Physics in Prague, until 1895, 
when he took the History and Theory of Inductive Natural Sciences chair in 
Vienna. 
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November 4). From his vacuum field equations 

Rfj.li = 0 , 



(II) 



where depends noirlinearly on and its first derivatives, and linearly 
on its second derivatives, he was able to explain the anomalous part of the 
perihelion precession of Mercury - in the note presented to the Academy 
on November 18. Aird finally, iir the paper [16] submitted on November 25 
(published oir December 2, 1915), the final version of the gravitational field 
equations, or Einstein’s field equations appeared:^ 



where the scalar curvature R = g^’' Rfjv Newton’s gravitational constant G 
and the velocity of light c are the (only) fmrdamental constants appearing 
in the theory. If irot stated otherwise, in this article we use the geometrized 
units in which G = c = 1, and the same conventions as in [18] and [19]. 

Now it is well kirown that Einsteiir further geireralized his field equations 
by adding a cosmological term oir the left side of the field equations 

(III). The cosmological constant A appeared first in Einstein’s work “Cos- 
mological coirsiderations iir the Geireral Theory of Relativity” [20] submitted 
on February 8, 1917 and published on February 15, 1917, which coirtaiired 
the closed static model of the Universe (the Einstein static universe) - an 
exact solution of equations (III) with A > 0 and an energy-momentum ten- 
sor of incoherent matter ( “dust” ) . This solution marked the birth of modern 
cosmology. 

We do not wish to embark upon the question of the role that Mach’s prin- 
ciple played iir Einstein’s thinking when constructing general relativity, or 
upon the intriguing issues relating to aspects of Mach’s principle in present- 
day relativity and cosmology^ - a problem which in any event would far 
exceed the scope of this article. Although it would not be inappropriate to 

^ David Hilbert submitted his paper on these field equations five days before Ein- 
stein, though it was published only on March 31, 1916. Recent analysis [17] 
of archival materials has revealed that Hilbert made significant changes in the 
proofs. The originally submitted version of his paper contained the theory which 
is not generally covariant, and the paper did not include equations (HI). 

® It was primarily Einstein’s recognition of the role of Mach’s ideas in his route 
towards general relativity, and in his christening them by the name “Mach’s 
principle” (though Schlick used this term in a vague sense three years before 
Einstein), that makes Mach’s Principle influential even today. After the 1988 
Prague conference on Ernst Mach and his influence on the development of physics 
[21], the 1993 conference devoted exclusively to Mach’s principle was held in 
Tubingen, from which a remarkably thorough volume was prepared [22], covering 
all aspects of Mach’s principle and recording carefully all discussion. The clarity 
of ideas and insights of Jiirgen Ehlers contributed much to both conferences and 
their proceedings. For a brief more recent survey of various aspects of Mach’s 
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include it here since exact solutions (such as Godel’s universe or Ozsvath’s 
and Schiicking’s closed model) have played a prominent role in this context. 
However, it should be at least stated that Einstein originally invented the idea 
of a closed space in order to eliminate boundary conditions at spatial infinity. 
The boundary conditions “fiat at infinity” bring with them an inertial frame 
unrelated to the mass-energy content of the space, and Einstein, in accordance 
with Mach’s views, believed that merely mass-energy can influence inertia. 
Field equations (III) are not inconsistent with this idea, but they admit as 
the simplest solution an empty fiat Minkowski space {T^^, = 0, = 

diag (—1, -1-1, -1-1, -1-1)), so some restrictive boundary conditions are essential 
if the idea is to be maintained. Hence, Einstein introduced the cosmological 
constant H, hoping that with this space will always be closed, and the bound- 
ary conditions eliminated. But it was also in 1917 when de Sitter discovered 
the solution [25] of the vacuum field equations (II) with added cosmological 
term {A > 0) which demonstrated that a nonvanishing A does not necessarily 
imply a nonvanishing mass-energy content of the universe. 

1.3 “Just So” Notes on the Simplest Solutions: The Minkowski, 
de Sitter, and Anti-de Sitter Spacetimes 

Our brief intermezzo on the cosmological constant brought up three explicit 
simple exact solutions of Einstein’s field equations - the Minkowski, Einstein, 
and de Sitter spacetimes. To these also belongs the anti de Sitter spacetime, 
corresponding to a negative A. The de Sitter spacetime has the topology x 

(with corresponding to the time) and is best represented geometrically 
as the 4-dimensional hyperboloid — = (3/H) in 5- 
dimensional fiat space with metric ds^ = —dv^ + dw^ + dx^ + dy^ + dz^. The 
anti de Sitter spacetime has the topology x R?, and can be visualized as 
the 4-dimensional hyperboloid —U'^ — V'^ + X'^ + Y'^ + Z'^ = (— 3/A), H < 0, in 
fiat 5-dimensional space with metric ds"^ = —dU'^ — dV'^+dX'^+dY'^+dZ'^. As 
is usual (cf. e.g. [26,27]), we mean by “anti de Sitter spacetime” the universal 
covering space which contains no closed timelike lines; this is obtained by 
unwrapping the circle S^. 

These spacetimes will not be discussed in the following sections. Ocas- 
sionally, for instance, in Sects. 5 and 10, we shall consider spacetimes which 
become asymptotically de Sitter. However, since these solutions have played 
a crucial role in many issues in general relativity and cosmology, and most 
recently, they have become important prerequisites on the stage of the theo- 
retical physics of the “new age” , including string theory and string cosmology, 
we shall make a few comments on these solutions here, and give some refer- 
ences to recent literature. 

principle in general relativity, see the introductory section in the work [23], in 
which Mach’s principle is analyzed in the context of perturbed Robertson- Walker 
universes. Most recently, Mach’s principle seems to enter even into M theory [24] . 
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The basic geometrical properties of these spaces are analyzed in the Bat- 
telle Recoirtres lectures by Penrose [27], aird in the monograph by Hawking 
and Ellis [26], where also references to older literature can be fouird. The im- 
portant role of the de Sitter solution in the theory of the expairdiirg uiriverse 
is finely described in the book by Peebles [28], and in much greater detail 
in the proceedings of the Bologna 1988 meeting on the history of modern 
cosmology [29]. 

The Minkowski, de Sitter aird anti de Sitter spacetimes are the simplest so- 
lutions in the sense that their metrics are of constant (zero, positive, and ireg- 
ative) curvature. They admit the same irumber (ten) of iirdependeirt Killing 
vectors, but the iirterpretations of correspoirdiirg symmetries differ for each 
spacetime. Together with the Einstein static universe, they all are confor- 
mally flat, aird can be represented as portions of the Einstein static universe 
[26,27]. However, their conformal structure is globally different. In Minkowski 
spacetime one can go to infinity along timelike geodesics and arrive to the 
future (or past) timelike infinity (or i~); along null geodesics one reaches 
the future (past) null infinity and spacelike geodesics lead to spa- 

tial infinity i^. Minkowski spacetime can be compactified and mapped onto 
a finite region by an appropriate conformal rescaling of the metric. One thus 
obtains the well-known Penrose diagram in which the three types of infini- 
ties are mapped onto the boundaries of the compactified spacetime ~ see for 
example the boundaries on the “right side” in the Penrose diagram of the 
Schwarzschild — Kruskal spacetime in Fig. 3, Sect. 2.4, or the Penrose com- 
pactified diagram of boost-rotation symmetric spacetimes in Fig. 13, Sect. 11. 
(The details of the conformal rescaling of the metric and resulting diagrams 
are given in [26,27] and in standard textbooks, for example [18,19,30].) In 
the de Sitter spacetime there are only past and future conformal infiirities 
J~ , both being spacclikc (cf. the Penrose diagram of the “cosmological” 
Robinsoir-Traritman solutions iir Fig. 11, Sect. 10); the coirformal infinity in 
anti de Sitter spacetime is timelike. 

These three spacetimes of constant curvature offer many basic insights 
which have played a most important role elsewhere in relativity. To give just 
a few examples (see e.g. [26,27]): both the particle (cosmological) horizons 
and the eveirt horizons for geodesic observers are well illustrated iir the de 
Sitter spacetime; the Cauchy horizoirs in the anti de Sitter space; and the 
simplest acceleration horizons in Miirkowski space (hypersurfaces t^ = in 
Fig. 12, Sect. 11). With the de Sitter spacetime oire learns (by considering 
differeirt cuts through the 4-dimensional hyperboloid) that the concept of 
an “open” or “closed” universe depends upon the choice of a spacelike slice 
through the spacetime. There is perhaps no simpler way to understand that 
Einstein’s field equations are of local nature, and that the spacetime topology 
is thus not given a priori, than by considering the following construction 
in Minkowski spacetime. Take the region given in the usual coordinates by 
|2:| < Ijjyl ^ ^ remove the rest and identify pairs of boundary 

points of the form (t, l,y, z) and (t, —l,y, z), and similarly for y aird z. In 
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this way the spatial sections are identified to obtain a 3-torus - a fiat but 
closed manifold.® 

The spacetimes of constant curvature have been resurrected as basic are- 
nas of new physical theories since their first appearance. After the role of the 
de Sitter universe decreased with the refutation of the steady-state cosmol- 
ogy, it has inflated again enormously in connection with the theory of early 
quasi-exponential phase of expansion of the universe, due to the false- vacuum 
state of a hypothetical scalar (infiaton) field(s) (see e.g. [28]). We shall men- 
tion the de Sitter space as the asymptotic state of cosmological models with a 
nonvanishing A (so verifying the “cosmic no-hair conjecture”) in Sect. 10 on 
Robinson-Trautman spacetimes. Motivated by its importance in inflationary 
cosmologies, several new useful papers reviewing the properties of de Sitter 
spacetime have appeared [33,34]; they also contain many references to older 
literature. For the most recent work on the quantum structure of de Sitter 
space, see [35]. 

In the last two years, anti de Sitter spacetime has come to the fore in light 
of Maldacena’s conjecture [36] relating string theory in (asymptotically) anti 
de Sitter space to a non-gravitational conformal field theory on the boundary 
at spatial infinity, which is timelike as mentioned above (see, e.g. [37], where 
among others, in the Appendix various coordinate systems describing anti de 
Sitter spaces in arbitrary dimensions are discussed). 

Amazingly, the Minkowski spacetime has recently entered the active new 
area of so called pre-big bang string cosmology [38]. String theory is here 
applied to the problem of the big bang. The idea is to start from a simple 
Minkowski space (as an “asymptotic past triviality”) and to show that it is in 
an unstable false-vacuum state, which leads to a long pre-big bangian infla- 
tionary phase. This, at later times, should provide a hot big bang. Although 
such a scenario has been criticized on various grounds, it has attractive fea- 
tures, and most importantly, can be probed through its observable relics [38]. 

Since it is hard to forecast how the roles of these three spacetimes of 
constant curvature will develop in new and exciting theories in the next mil- 
lennium, let us better conclude our “just so” notes by stating three “stable” 
results of complicated, rigorous mathematical analyses of (the classical) Ein- 
stein’s equations. 

In their recent treatise [39], Christodoulou and Klainerman prove that 
any smooth, asymptotically fiat initial data set which is “near fiat, Minkowski 
data” leads to a unique, smooth and geodesically complete solution of Ein- 

® This very simple point was apparently unknown to Einstein in 1917, although 
soon after the publication of his cosmological paper, E. Freundlich and F. Klein 
pointed out to him that an elliptical topology (arising from the identification of 
antipodal points) could have been chosen instead of the spherical one considered 
by Einstein. Although topological questions have been followed with a great in- 
terest in recent decades, the chapter by Geroch and Horowitz in “An Einstein 
Centenary Survey” [31] remains the classic; for more recent texts, see for example 
[32] and references therein. 
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stein’s vacuum equations with vanishing cosmological constant. This demon- 
strates the stability of the Minkowski space with respect to nonlinear (vac- 
uum) perturbations, and the existence of singularity-free, asymptotically flat 
radiative vacuum spacetimes. Christodoulou and Klainerman, however, are 
able to show only a somewhat weaker decay of the field at null infinity than is 
expected from the usual assumption of a sufficient smoothness at null infinity 
in the framework of Penrose (see e.g. [40] for a brief account). 

Curiously enough, in the case of the vacuum Einstein equations with a 
nonvanishing cosmological constant, a more complete picture has been known 
for some time. By using his regular conformal field equations, Friedrich [41] 
demonstrated that initial data sufficiently close to de Sitter data develop into 
solutions of Einstein’s equations with a positive cosmological constant, which 
are “asymptotically simple” (with a smooth conformal infinity), as required 
in Penrose’s framework. More recently, Friedrich [42] has shown the existence 
of asymptotically simple solutions to the Einstein vacuum equations with a 
negative cosmological constant. For the latest review of Friedrich’s thorough 
work on asymptotics, see [43]. 

Summarizing, thanks to these profound mathematical achievements we 
know that the Minkowski, de Sitter, and anti de Sitter spacetimes are the 
solutions of Einstein’s field equations which are stable with respect to general, 
nonlinear (though “weak” in a functional sense) vacuum perturbations. A 
result of this type is not known for any other solution of Einstein’s equations. 

1.4 On the Interpretation and Characterization of Metrics 

Suppose that a metric satisfying Einstein’s field equations is known in some 
region of spacetime and in a given coordinate (reference) system x^. A funda- 
mental question, frequently “forgotten” to be addressed in modern theories 
which extend upon general relativity, is whether the metric tensor gap{x^) 
is a measurable quantity. Classical general relativity offers (at least) three 
ways of giving a positive answer, depending on what objects are considered 
as “primitive tools” to perform the measurements. The first, elaborated and 
emphasized primarily by Mpller [44] , employs standard rigid rods in the mea- 
surements. However, a “rigid rod” is not really a simple primitive concept. 
The second procedure, due to Synge [45], accepts as the basic concepts a 
“particle” and a “standard clock” . If and x^ + dx^ are two nearby events 
contained in the worldline of a clock, then the separation (the spacetime in- 
terval) between the events is equal to the interval measured by the clock. The 
main drawback of this approach appears to lie in the fact that it does not 
explain why the same functions gapix^) describe the behavior of the clock as 
well as paths of free particles, as explained in more detail by Ehlers, Pirani 
and Schild [46], in the motivation for their own axiomatic but constructive 
procedure for setting up the spacetime geometry. Their method, inspired by 
the work of Weyl and others, uses neither rods nor clocks, but instead, light 
rays and freely falling test particles, which are considered as basic tools for 
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measuring the metric aird determiiring the spacetime geometry. (For a simple 
descriptioir of how this cair be performed, see exercise 13.7 in [18]; for some 
irew developments which build upon, amoirg others, the Ehlers-Pirani-Sachs 
approach, see [47].) After iirdicatiirg that the metric tensor is a measurable 
quantity let us briefly turn to the role of spacetime coordinates. 

In special relativity there are infinitely luany global inertial coordiirate 
systems labelliirg events in the Miirkowski manifold they are related by el- 
emeirts of the Poincare group. The iirertial coordiirates labels 
of a given event do not thus have intrinsic meaning. However, the space- 
time interval between two events, determined by the Minkowski metric 77 ^ 1 ^, 
represents an intrinsic property of spacetime. Since the Minkowski metric 
is so simple, the differences between inertial coordinates can have a metri- 
cal meaning (recall Einstein’s reply to Abraham meirtioned in Sect. 1.2). In 
principle, however, both in special and general relativity, it is the metric, 
the line eleiuent, which exhibits intrinsically the geometry, and gives all rel- 
evant information. As Misner [48] puts it, if you write down for someone the 
Schwarzschild metric in the “canonical” form (equation (2) in Sect. 2.2) and 
receive the reaction “that [it] tells me the Pfn, gravitational potentials, now 
tell me in which {t,r,9,(f) coordinate system they have these values?”, then 
there are two valid responses: (a) indicate that it is an indelicate and unneces- 
sary question, or (b) ignore it. Clearly, the Schwarzschild metric describes the 
geometrical properties of the coordinates used in (2). For example, it implies 
that worldlines with fixed r, 9, (p are timelike at r > 2M, orthogonal to the 
lines with t = constant. It determines local null cones (given by ds'^ = 0), i.e. 
the causal structure of the spacetime. In addition, in Schwarzschild coordi- 
nates the metric ( 2 ) indicates how to measure the radial coordinate of a given 
event, because the proper area of the sphere going through the event is given 
just by the Euclidean expression Airr'^ (r is thus often called “the curvature 
coordinate”). On each sphere the angular coordinates 9,ip have the same 
meaning as on a sphere in Euclidean space. The Schwarzschild coordinate 
time t, geometrically preferred by the timelike (for r > 2M) Killing vector, 
which is just equal to d/dt, can be measured by radar signals sent out from 
spatial infinity (r 2M) where t is the proper time (see e.g. [18]). The coor- 
dinates used in ( 2 ) are in fact “more unique” than the inertial coordinates in 
Minkowski spacetime, because the only possible continuous transformations 
preserving the form ( 2 ) are rigid rotations of a sphere, and t ^ t + constant. 
Such a simple interpretation of coordinates is exceptional. However, the sim- 
ple case of the Schwarzschild metric clearly demonstrates that all intrinsic 
information is contained in the line element. 

It is interesting, and for some purposes useful, to consider not just one 
Schwarzschild metric with a given mass M but the family of such metrics 
for all possible M. In order to cover also the future event horizon let us 
describe the metrics by using Eddington-Finkelstein ingoing coordinates as 
in equation (4), Sect. 2.3. This equation can be interpreted as a family of 
metrics with various values of M given on a fixed background manifold M. 1 , 
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with V G M,r G {0, oo), and 9 G [0, tt], € [0, 2tt). Alternatively, however, we 
may use, for example, the Kruskal null coordinates U, V in which the metric 
is given by equation (6), Sect. 2.3, with U = V — U,V = V + U. We may 
then consider metrics on a background manifold M 2 given hy IJ G M,V G 
(0, 00 ), 9 G [0,7t], and (p G [0, 27t), which corresponds to M\. However, these 
two background manifolds are not the same: the transformation between the 
Eddington-Finkelstein coordinates and the Kruskal coordinates is not a map 
from Ml to M 2 because it depends on the value of mass M. Therefore, 
the “background manifold” used frequently in general relativity, for example 
in problems of conservation of energy, or in quantum gravity, is not defined 
in a natural, unique manner. The above simple pedagogical observation has 
recently been made in connection with gauge fixing in quantum gravity by 
Hajfcek [49] in order to explain the old insight by Bergmann and Komar, that 
the gauge group of general relativity is much larger than the diffeomorphism 
group of one manifold. To identify points when working with backgrounds, 
one usually fixes coordinates in all solution manifolds by some gauge condi- 
tion, and identifies those points of all these manifolds which have the same 
value of the coordinates. 

Returning back to a single solution {MjQap), described by a manifold M 
and a metric gap in some coordinates, a notorious (local) “equivalence prob- 
lem” often arises. A given (not necessarily global) solution has the variety of 
representations which equals the variety of choices of a 4-dimensional coor- 
dinate system. Transitions from one choice to another are isomorphic with 
the group of 4-dimensional diffeomorphisms which expresses the general co- 
variance of the theory.^ Given another set of functions g(,^(a;'''') which satisfy 
Einstein’s equations, how do we learn that they are not just transformed com- 
ponents of the metric In 1869 E. B. Christoffel raised a more general 

question: under which conditions is it possible to transform a quadratic form 
gap{x'^)dx°‘dx^ in n-dimensions into another such form g'^p{x'"^)dx'°‘dx'd by 
means of smooth transformation x'^{x"^)7 As Ehlers emphasized in his paper 

^ As pointed out by Kretschmann soon after the birth of general relativity, one can 
always make a theory generally covariant by taking more variables and insert- 
ing them as new dynamical variables into the (enlarged) theory. Thus, standard 
Yang-Mills theory is covariant with respect to the transformations of Yang-Mills 
potentials, corresponding to a particular group, say SU{2). However, the the- 
ory is usually formulated on a fixed background spacetime with a given metric. 
The evolution of a dynamical Yang-Mills solution is thus “painted” on a given 
spacetime. When the metric - the gravitational held - is incorporated as a dy- 
namical variable in the Einstein-Yang-Mills theory, the whole spacetime metric 
and Yang-Mills held are “built-up” from given data (cf. the article by Friedrich 
and Rendall in this volume). The resulting theory is covariant with respect to 
a much larger group. The dual role of the metric, determined only up to 4- 
dimensional diffeomorphisms, makes the character of the solutions of Einstein’s 
equations unique among solutions of other held theories, which do not consider 
spacetime as being dynamical. 
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[50] on the meaning of Christoffel’s equivalence problem in modern field the- 
ories, Christoffel’s results apply to metrics of arbitrary signature, and can be 
thus used directly in general relativity. Without going into details let us say 
that today the solution to the equivalence problem as presented by Cartan is 
most commonly used. For both metrics gajs and one has to find a frame 
(four 1-forms) in which the frame metric is constant, and find the frame com- 
ponents of the Riemann tensor and its covariant derivatives up to - possibly 
- the 10th order. The two metrics gap and g'^^ are then equivalent if and 
only if there exist coordinate and Lorentz transformations under which one 
whole set of frame components goes into the other. In a practical algorithm 
given by Karlhede [51], recently summarized and used in [52], the number of 
derivations required is reduced. 

A natural first idea of how to solve the equivalence problem is to employ 
the scalar invariants from the Riemann tensor and its covariant derivatives. 
This, however, does not work. For example, in all Petrov type N and III 
nonexpanding and nontwisting solutions all these invariants vanish as shown 
recently (see Sect. 8.2), as they do in Minkowski spacetime. 

However, even without regarding invariants, at present much can be learnt 
about an exact solution (at least locally) in geometrical terms, without ref- 
erence to special coordinates. This is thanks to the progress started in the 
late 1950s, in which the group of Pascual Jordan in Hamburg has played the 
leading role, with Jurgen Fillers as one of its most active members. Ehlers’ 
dissertation® [54] from 1957 is devoted to the characterization of exact solu- 
tions. 

The problem of exact solutions also forms the content of his contribution 
to the Royaumont GR-conference [55], as well as his plenary talk in the Lon- 
don GR-conference [56] . A detailed description of the results of the Hamburg 
group on invariant geometrical characterization of exact solutions by using 
and developing the Petrov classification of Weyl’s tensors, groups of isome- 
tries, and conformal transformations are contained in the first paper [57] in 
the (today “golden oldies” ) series of articles published in the “Abhandlungen 
der Akademie der Wissenschaften in Mainz” . An English version, in a some- 
what shorter form, was published by Ehlers and Kundt [53] in the “classic” 
1962 book “Gravitation: An Introduction to Gurrent Research” compiled by 
L. Witten. (We shall meet these references in the following sections.) In the 
second paper of the “Abhandlungen” [58], among others, algebraically spe- 
cial vacuum solutions are studied, using the formalism of the 2-component 
spinors, and in particular, geometrical properties of the congruences of null 
rays are analyzed in terms of their expansion, twist, and shear. 

® The English translation of the title of the dissertation reads: “The construction 
and characterization of the solutions of Einstein’s gravitational field equations” . 
In [53] the original German title is quoted, as in our citation [54], but “of the 
solutions” is erroneously omitted. This error then reemerges in the references in 
[19]. 
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These tools became essential for the discovery by Roy Kerr in 1963 of the 
solution which, when compared with all other solutions of Einstein’s equa- 
tions found from the beginning of the renaissance of general relativity in the 
late 1950s until today, has played the most important role. As Chandrasekhar 
[59] eloquently expresses his wonder about the remarkable fact that all sta- 
tionary and isolated black holes are exactly described by the Kerr solution: 
“This is the only instance we have of an exact description of a macroscopic 
object. Macroscopic objects, as we see them all around us, are governed by 
a variety of forces, derived from a variety of approximations to a variety of 
physical theories. In contrast, the only elements in the construction of black 
holes are our basic concepts of space and time ...” The Kerr solution can also 
serve as one of finest examples in general relativity of “the incredible fact 
that a discovery motivated by a search after the beautiful in mathematics 
should find its exact replica in Nature...” [60]. 

The technology developed in the classical works [53,57], and in a number 
of subsequent contributions, is mostly concerned with the local geometrical 
characterization of exact spacetime solutions. A well-known feature of the 
solutions of Einstein’s equations, not shared by solutions in other physical 
theories, is that it is often very complicated to analyze their global proper- 
ties, such as their extensions, completeness, or topology. If analyzed globally, 
almost any solution can tell us something about the basic issues in general 
relativity, like the nature of singularities, or cosmic censorship. 

1.5 The Choice of Solutions 

Since most solutions, when properly analyzed, can be of potential interest, 
we are confronted with a richness of material which puts us in danger of men- 
tioning many of them, but remaining on a general level, and just enumerating 
rather than enlightening. In fact, because of lack of space (and of our under- 
standing) we shall have to adopt this attitude in many places. However, we 
have selected some solutions, hopefully the fittest ones, and when discussing 
their role, we have chosen particular topics to be analyzed in some detail, 
and left other issues to brief remarks and references. 

Firstly, however, let us ask what do we understand by the term “exact 
solution”. In the much used “exact-solution-book” [61], the authors “do not 
intend to provide a definition”, or, rather, they have decided that what they 
“chose to include was, by definition, an exact solution”. A mathematical 
relativist-purist would perhaps consider solutions, the existence of which has 
been demonstrated in the works of Friedrich or Christodoulou and Klainer- 
man, mentioned at the end of Sect. 1.3, as “good” as the Schwarzschild metric. 
Most recently, Penrose [62] presented a strong conjecture which may lead to a 
general vacuum solution described in the complicated (complex) formalism of 
his twistor theory. Although in this article we do not mean by exact solutions 
those just mentioned, we also do not consider as exact solutions only those 
explicit solutions which can be written in terms of elementary functions on 
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half of a page. We prefer, recalling Feynman, simple “special cases”, but we 
also discuss, for example, the late-time behaviour of the Robinson-Trautman 
solutions for which rigorously convergent series expansions can be obtained, 
which provide sufficiently rich “special information” . 

Concerning the selection of the solutions, the builder of general relativity 
and the gravitational field equations (III) himself indicates which solutions 
should be preferred [63] : “The theory avoids all internal discrepancies which 
we have charged against the basis of classical mechanics... But, it is similar 
to a building, one wing of which is made of fine marble (left part of the 
equation), but the other wing of which is built of low grade wood (right side 
of equation). The phenomenological representation of matter is, in fact, only 
a crude substitute for a representation which would correspond to all known 
properties of matter. There is no difficulty in connecting Maxwell’s theory... 
so long as one restricts himself to space, free of ponderable matter and free 
of electric density...” 

Of course, Einstein was not aware when he was writing this of Yang- 
Mills-Higgs fields, or of the dilaton field, etc. However, remaining on the 
level of field theories with a clear classical meaning, his view has its strength 
and motivates us to prefer (electro)vacuum solutions. A physical interpre- 
tation of the vacuum solutions of Einstein’s equations have been reviewed 
in papers by Bonnor [64], and Bonnor, Griffiths and MacCallum [65] five 
years ago. Our article, in particular in emphasizing and describing the role 
of solutions in giving rise to various concepts, conjectures, and methods of 
solving problems in general relativity, and in the astrophysical impacts of the 
solutions, is oriented quite differently, and gives more detail. However, up to 
some exceptions, like, for example, metrics for an infinite line-mass or plane, 
which are discussed in [64], and new solutions which have been discovered 
after the reviews [64,65] appeared as, for example, the solution describing a 
rigidly rotating thin disk of dust, our choice of solutions is similar to that of 
[64,65]. 

In selecting particular topics for a more detailed discussion we will be 
led primarily by following overlapping aspects: (i) the “commonly acknowl- 
edged” significance of a solution - we will concentrate in particular on the 
Schwarzschild, the Kerr, the Taub-NUT, and plane wave solutions, and (ii) 
the solutions and their properties that I (and my colleagues) have been di- 
rectly interested in, such as the Reissner-Nordstrom metric, vacuum solu- 
tions outside rotating disks, or radiative solutions such as cylindrical waves, 
Robinson-Trautman solutions, and the boost-rotation symmetric solutions. 
Some of these have also been connected with the interests of Jurgen Ehlers, 
and we shall indicate whenever we are aware of this fact. 

Vacuum cosmological solutions are discussed in less detail than they de- 
serve. A possible excuse - from the point of view of being a relativist, a rather 
unfair one - could be that a special recent issue of Reviews of Modern Physics 
(Volume 71, 1999), marking the Centennial of the American Physical Soci- 
ety, contains discussion of the Schwarzschild, the Reissner-Nordstrom and 
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other black hole solutions, and even remarks on the work of Bondi et al. [66] 
on radiative solutions, but among the cosmological solutions only the stan- 
dard models are mentioned. A real reason is the author’s lack of space, time, 
and energy. In the concluding remarks we will try to list at least the most 
important solutions (not only the Friedmann models!) which have not been 
“selected” and give references to the literature in which more information 
can be found. 



1.6 The Outline 

Since the titles of the following sections characterize the contents rather 
specifically, we restrict ourselves to only a few explanatory remarks. In our 
discussion of the Schwarschild metric, after mentioning its role in the solar 
system, we indicate how the Schwarzschild solution gave rise to such concepts 
as the event horizon, the trapped surface, and the apparent horizon. We pay 
more attention to the concept of a bifurcate Killing horizon, because this 
is usually not treated in textbooks, and in addition, Jurgen Ehlers played 
a role in its first description in the literature. Another point which has not 
received much attention is Penrose’s nice presentation of evidence against 
Lorentz-covariant field theoretical approaches to gravity, based on analysis 
of the causal structure of the Schwarzschild spacetime. Among various astro- 
physical implications of the Schwarzschild solution we especially note recent 
suggestions which indicate that we may have evidence of the existence of 
event horizons, and of a black hole in the centre of our Galaxy. 

The main focus in our treatment of the Reissner-Nordstrom metric is 
directed to the instability of the Cauchy horizon and its relation to the cosmic 
censorship conjecture. We also briefly discuss extreme black holes and their 
role in string theory. 

About the same amount of space as that given to the Schwarzschild so- 
lution is devoted to the Kerr metric. After explaining a few new concepts 
the metric inspired, such as locally nonrotating frames and ergoregions, we 
mention a number of physical processes which can take place in the Kerr back- 
ground, including the Penrose energy extraction process, and the Blandford- 
Znajek mechanism. In the section on the astrophysical evidence for a Kerr 
metric, the main attention is paid to the broad iron line, the character of 
which, as most recent observations indicate, is best explained by assuming 
that it originates very close to a maximally rotating black hole. The dis- 
cussion of recent results on black hole uniqueness and on multi-black hole 
solutions concludes our exposition of spacetimes representing black holes. In 
the section on axisymmetric fields and relativistic disks a brief survey of var- 
ious static solutions is first given, then we concentrate on relativistic disks 
as sources of the Kerr metric and other stationary fields; in particular, we 
summarize briefly the recent work on uniformly rotating disks. 
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An intriguing case of Taub-NUT space is introduced by a new constructive 
derivation of the solution. Various pathological features of this space are then 
briefly listed. 

Going over to radiative spacetimes, we analyze in some detail plane waves 
- also in the light of the thorough study by Ehlers and Kundt [53]. Some 
new developments are then noted, in particular, impulsive waves generated 
by boosting various “particles”, their symmetries, and recent use of the 
Colombeau algebra of generalized functions in the analyses of impulsive 
waves. A fairly detailed discussion is devoted to various effects connected 
with colliding plane waves. 

In our treatment of cylindrical waves we concentrate in particular on two 
issues: on the proof that these waves provide explicitly given spacetimes, 
which admit a smooth global null infinity, even for strong initial data within 
a (2+ l)-dimensional framework; and on the role that cylindrical waves have 
played in the first construction of a midisuperspace model in quantum grav- 
ity. Various other developments concerning cylindrical waves are then sum- 
marized only telegraphically. 

A short section on Robinson-Trautman solutions points out how these 
solutions with a nonvanishing cosmological constant can be used to give an 
exact demonstration of the cosmic no-hair conjecture under the presence of 
gravitational radiation, and also of the existence of an event horizon which 
is smooth but not analytic. 

As the last class of radiative spacetimes we analyze the boost-rotation 
symmetric solutions representing uniformly accelerated objects. They play a 
unique role among radiative spacetimes since they are asymptotically flat, 
in the sense that they admit global smooth sections of null infinity. And as 
the only known radiative solutions describing finite sources they can provide 
expressions for the Bondi mass, the news function, or the radiation patterns in 
explicit forms. They have also been used as test-beds in numerical relativity, 
and as the model spacetimes describing the production of black hole pairs in 
strong fields. 

Vacuum cosmological solutions such as the vacuum Bianchi models and 
Gowdy solutions are mentioned, and their significance in the development of 
general relativity is indicated in the last section. Special attention is paid to 
their role in understanding the behaviour of a general model near an initial 
singularity. 

In the concluding remarks, several important, in particular non-vacuum 
solutions, which have not been included in the main body of the paper, are 
at least listed, together with some relevant references. A few remarks on the 
possible future role of exact solutions ends the article. 

Although we give over 360 references in the bibliography, we do not at 
all pretend to give all relevant citations. When discussing more basic facts 
and concepts, we quote primarily textbooks and monographs. Only when 
mentioning more recent developments do we refer to journals. The complete 
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titles of all listed references will hopefully offer the reader a more complete 
idea of the role the explicit solutions have played on the relativistic stage and 
in the astrophysical sky. 



2 The Schwarzschild Solution 

In his thorough “Survey of General Relativity Theory” [67], Jiirgen Ehlers 
begins with an empirical motivation of the theory, goes in depth and detail 
through his favourite topics such as the axiomatic approach, kinetic theory, 
geometrical optics, approximation methods, and only in the last section turns 
to spherically symmetric spacetimes. As T. S. Eliot says, “to make an end is 
to make a beginning - the end is where we start from” , and so here we start 
with a few remarks on spherical symmetry. 

2.1 Spherically Symmetric Spacetimes 

In the early days of general relativity spherical symmetry was introduced in 
an intuitive manner. It is because of the existence of exact solutions which 
are singular at their centres (such as the Schwarzschild or the Reissner- 
Nordstrom solutions) , and a realization that spherically symmetric, topolog- 
ically non-trivial smooth spacetimes without any centre may exist [68], that 
today the group-theoretical definition of spherical symmetry is preferred (for 
a detailed analysis, see e.g. [19,26,67]). 

Following Ehlers [67], we define a spacetime {M,ga/ 3 ) to be spherically 
symmetric if the rotation group SO 3 acts on (A 4 ,ga/ 3 ) as an isometry group 
with simply connected, complete, spacelike, 2-dimensional orbits. One can 
then prove the theorem [67,69] that a spherically symmetric spacetime is the 
direct product A4 = S'^ x N, where is the 2-sphere manifold with the 
standard metric gs on the unit sphere; and is a 2-dimensional manifold 
with a Lorentzian (indefinite) metric g^, and with a scalar r such that the 
complete spacetime metric g^^ is “conformally decomposable”, i.e. r~^gaj 3 
is the direct sum of the 2-dimensional parts gjq and gs- Leaving further 
technicalities aside (see e.g. [26,67,69]) we write down the final spherically 
symmetric line element in the form 

ds^ = + e^^dr'^ + r‘^{d9‘^ + sin^ 9 dip'^), (1) 

where (following [67]) we permit 4>{r,t) and A(r, t) to have an imaginary part 
iTr/2 so that the signs of dt^ and dr^ in (1), and thus the role of r and t 
as space- and time- coordinates may interchange (a lesson learned from the 
vacuum Schwarzschild solutions - see below). The “curvature coordinate” 
r is defined invariantly by the area, 47rr^, of the 2-spheres r = constant, 
t = constant. There is no a priori relation between r and the proper distance 
from the centre (if there is one) to the spherical surface. 
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2.2 The Schwarzschild Metric and Its Role in the Solar System 

Starting from the line element (1) and imposing Einstein’s vacuum field equa- 
tions, but allowing spacetime to be in general dynamical, we are led uniquely 
(cf. BirkhofPs theorem discussed e.g. in [18,26]) to the Schwarzschild metric 



where M = constant has to be interpreted as a mass, as test particle orbits 
show. The resulting spacetime is static at r > 2M (no spherically symmetric 
gravitational waves exist), and asymptotically flat at r — >■ oo. 

Undoubtedly, the Schwarzschild solution, describing the exterior gravi- 
tational field of an arbitrary - static, oscillating, collapsing or expanding - 
spherically symmetric body of (Schwarzschild) mass M, is among the most 
influential solutions of the gravitational field equations, if not of any type 
of field equations invented in the 20th century. It is the first exact solution 
of Einstein’s equations obtained -- by K. Schwarzschild in December 1915, 
still before Einstein’s theory reached its definitive form and, independently, 
in May 1916, by J. Droste, a Dutch student of H. A. Lorentz (see [70] for 
comprehensive survey). 

However, in its exact form (involving regions near r fa 2M) the metric (2) 
has not yet been experimentally tested (a more optimistic recent suggestion 
will be mentioned in Sect. 2.6). When in 1915 Einstein explained the perihe- 
lion advance of Mercury, he found and used only an approximate (to second 
order in the gravitational potential) spherically symmetric solution. In order 
to find the value of the deflection of light passing close to the surface of the 
Sun, in his famous 1911 Prague paper, Einstein used just the equivalence 
principle within his “Prague gravity theory”, based on the variable velocity 
of light. Then, in 1915, he obtained this value to be twice as big in general 
relativity, when, in addition to the equivalence principle, the curvature of 
space (determined from (2) to first order in M/r) was taken into account. 

Despite the fact that for the purpose of solar-system observations the 
Schwarzschild metric in the form (2) is, quoting [18], “too accurate”, it has 
played an important role in experimental relativity. Eddington, Robertson 
and others introduced the method of expanding the Schwarzschild metric 
at the order beyond Newtonian theory, and then multiplying each post- 
Newtonian term by a dimensionless parameter which should be determined by 
experiment. These methods inspired the much more powerful PPN (‘Para- 
metrized post- Newtonian”) formalism which was developed at the end of the 
1960s and the beginning of the 1970s for testing general relativity and alter- 
native theories of gravity. It has been very effectively used to compare general 
relativity with observations (see e.g. [18,71,72] and references therein). In or- 
der to gain at least some concrete idea, let us just write down the simplest 
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generalization of (2), namely the metric 




dt^ + ( 1 + 2y 



M 



r 




+r^ + sin^ 9 dtp^) , 



( 3 ) 



which is obtained by expanding the metric (2) in Mjr up to one order beyond 
the Newtonian approximation, and multiplying each post-Newtonian term 
by dimensionless parameters which distinguish the post-Newtonian limits of 
different metric theories of gravity, and should be determined experimentally. 
(In general, one needs not just two but ten PPN parameters [18,71,72].) In 
Einstein’s theory: /3 = 7 = 1. Calculating from metric (3) the advance of the 
pericentre of a test particle orbiting a central mass M on an ellipse with semi- 
major axis a and eccentricity e, one finds A(^ = i(2-|-27 — /3)67rM/[a(l — e^)j, 
whereas the total deflection angle of electromagnetic waves passing close to 
the surface of the body is Ai\) = 2(1 -|- where is the radius of 

closest approach of photons to the central body. 

Measurements of the deflection of radio waves and microwaves by the Sun 
(recently also of radio waves by Jupiter) at present restrict 7 to |(1 -I- 7) = 
l.OOOlihO.OOl [71,72]. Planetary radar rangings, mainly to Mercury, give from 
the perihelion shift measurements the result {2j + 2 — P)/3 = 1.00±0.002, so 
that /3 = 1.000 ± 0.003, whereas the measurements of periastron advance for 
the binary pulsar systems such as PSR 1913-1-16 implied agreement with Ein- 
stein’s theory to better than about 1% (see e.g. [71,72] for reviews). There are 
other solar-system experiments verifying the leading orders of the Schwarz- 
schild solution to a high accuracy, such as gravitational redshift, signal retar- 
dation, or lunar geodesic precession. A number of advanced space missions 
have been proposed which could lead to significant improvements in values of 
the PPN parameters, and even to the measurements of post-post-Newtonian 
effects [72]. 

Hence, though in an approximate form, the Schwarzschild solution has 
had a great impact on experimental relativity. In addition, the observational 
effects of gravity on light propagation in the solar system, and also today 
routine observations of gravitational lenses in cosmological contexts [73] , have 
significantly increased our confidence in taking seriously similar predictions 
of general relativity in more extreme conditions. 

2.3 Schwarzschild Metric Outside a Collapsing Star 

I recall how Roger Penrose, at the beginning of his lecture at the 1974 Erice 
Summer School on gravitational collapse, placed two figures side by side. The 
first illustrated schematically the bending of light rays by the Sun (surpris- 
ingly, Penrose did not write “Prague 1911” below the figure). I do not remem- 
ber exactly his second figure but it was similar to Fig. 1 below: the spacetime 
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Fig. 1. The gravitational collapse of a spherical star (the interior of the star is 
shaded). The light cones of the three events, O, P, Q, at the centre of the star, and 
of the three events outside the star are illustrated. The event horizon, the trapped 
surfaces, and the singularity formed during the collapse are also shown. Although 
the singularity appears to lie in a “time direction” , from the character of the light 
cone outside the star but inside the event horizon it is seen that it has a spacelike 
character. 



diagram showing spherical gravitational collapse through the Schwarzschild 
radius into a spherical black hole. 

It is in all modern books on general relativity that the Schwarzschild 
radius at = 2M is the place where Schwarzschild coordinates t, r are 
unsuitable, and that metric (2) has a coordinate singularity but not a physi- 
cal one. One has to introduce other coordinates to extend the Schwarzschild 
metric through Rg. In order to describe all spacetime outside a collapsing 
spherical body it is advantageous to use ingoing Eddington-Finkelstein co- 
ordinates (u, r, 9, (f) where v = t + r + 2Mlog(r/2M — 1). Metric (2) takes 
the form 

ds^ = — (^^2 _|_ 2dvdr + {d9^ + sin^ 9 dtp^) , (4) 

{v,9,(f) = constant are ingoing radial null geodesics. Figure 1, plotted in 
these coordinates, demonstrates well several basic concepts and facts which 
were introduced and learned after the end of 1950s when a more complete un- 
derstanding of the Schwarzschild solution was gradually achieved. The metric 
(4) holds only outside the star, there will be another metric in its interior, for 
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example the Oppenheimer-Snyder collapsing dust solution (i.e. a portion of a 
collapsing Friedmann universe), but the precise form of the interior solution 
is not important at the moment. Consider a series of flashes of light emitted 
from the centre of the star at events O, P, Q (see Fig. 1) and assume that 
the stellar material is transparent. As the Sun has a focusing effect on the 
light rays, so does matter during collapse. As the matter density becomes 
higher and higher, the focusing effect increases. At event P a special wave- 
front will start to propagate, the rays of which will emerge from the surface 
of the star with zero divergence, i.e. the null vector A:“ = dx°‘ /dw, w being 
an affine parameter, tangent to null geodesics, satisfies fcf), = 0. The wave- 
front then “stays” at the hypersurface r = 2M in metric (4), and the area of 
its 2-dimensional cross-section remains constant. The null hypersurface rep- 
resenting the history of this critical wavefront is the (future) event horizon. 
Note that the light cones turn more and more inwards as the event horizon is 
approached. They become tangential to the horizon in such a way that radial 
outgoing photons stay at r = 2M whereas ingoing photons fall inwards, and 
will eventually reach the curvature singularity at r = 0. As Fig. 1 indicates, 
wavefronts emitted still later than the critical one, as for example that emit- 
ted from event Q, will be focused so strongly that their rays will start to 
converge, and will form (closed) trapped surfaces. The light cones at trapped 
surfaces are so turned inwards that both ingoing and outgoing radial rays 
converge, and their area decreases. 

Consider a family of spacelike hypersurfaces A(t) foliating spacetime (r 
is a time coordinate, e.g. v — r). The boundary of the region of A(t) which 
contains trapped surfaces lying in Sir) is called the apparent horizon in S(r). 

In general, the apparent horizon is different from the intersection of the 
event horizon with A(r), as a nice simple example (based again on an exact 
solution) due to Hawking [74] shows. Assume that after the spherical collapse 
of a star a spherical thin shell of mass m surrounding the star collapses 
and eventually crashes at the singularity at r = 0 (Fig. 2). In the vacuum 
region inside the shell there is the Schwarzschild metric (4) with mass M, 
and outside the shell with mass M + m. Hence the apparent horizon on A(ti) 
will be at r = 2M and will remain there until S{t 2 ) when it discontinuously 
jumps to r = 2(M + m). One can determine the apparent horizon on a 
given hypersurface. In order to find the event horizon one has to know the 
whole spacetime solution. The future event horizon separates events which 
are visible from future infinity, from those which are not, and thus forms the 
boundary of a black hole. 

From the above example of a shell collapsing onto a Schwarzschild black 
hole we can also learn about the “teleological” nature of the horizon: the mo- 
tion of the horizon depends on what will happen to the horizon in the future 
(whether a collapsing shell will cross it or not). This teleological behaviour of 
the horizon has later been discovered in a variety of astrophysically realistic 
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situations such as the behaviour of a horizon perturbed by a mass orbiting a 
black hole (see [75] for enlightening discussions of such effects). 

By studying the Schwarzschild solution and spherical collapse it became 
evident that one has to turn to global methods to gain a full understanding of 
general relativity. The intuition acquired from analyzing the Schwarzschild 
metric helped crucially in defining and understanding such concepts as the 
trapped surface, the event horizon, or the apparent horizon in general situ- 
ations without symmetry. Nowadays these concepts are explained in several 
advanced textbooks and monographs (e.g. [18,19,26,32,76]). 

Following from the example of spherical collapse one is led to ask whether 
generic gravitational collapses lead to spacetime singularities and whether 
these are always surrounded by an event horizon. The Penrose-Hawking sin- 
gularity theorems [19,26] show that singularities do arise under quite generic 
circumstances (the occurrence of a closed trapped surface is most significant 
for the appearance of a singularity). The second question is the essence of 
the cosmic censorship hypothesis. Various exact solutions have played a role 
in attempts to “prove” or “disprove” this “one of the most important issues” 
of classical relativity. We shall meet it in several other places later on, in 
particular in Sect. 3.1. There a more detailed formulation is given. 




Fig. 2. The “teleological” behaviour of the event horizon during the gravitational 
collapse of a star, followed by the collapse of a shell. The event horizon moves out- 
wards because it will be crossed by the shell. The apparent horizon moves outwards 
discontinuously (adapted from [74]). 
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2.4 The Schwarzschild Kruskal Spacetime 

In the remarks above we considered the Schwarzschild solution outside a 
static (possibly oscillating, or expanding from r > 2M) star, and outside 
a star collapsing into a black hole. It is not excluded that just these situa- 
tions will turn out to be physically relevant. Nevertheless, in connection with 
the Schwarzschild metric it would be heretical not to mention the enormous 
impact which its maximal vacuum analytic extension into the Schwarzschild- 
Kruskal spacetime has had. This is today described in detail in many places 
(see e.g. [18,19,26,76]). We need two sets of the Schwarzschild coordinates to 
cover the complete spacetime, and we obtain two asymptotically flat spaces, 
i.e. the spacetime with two (“right” and “left”) infinities. The metric in 
Kruskal coordinates U, V, related to the Schwarzschild r, t (in the regions 
with r > 2M) by 

U = ±{r/2M - l)l/2gr/4M (i/4M) , 

V = ±(r/2M - l)i/2gr-/4M (t/4M) , (5) 

takes the form 

ds^ = (-dV^ + dU^) + {dO^ + sin^ 9 V) . (g) 

The introduction of the Kruskal coordinates which remove the singularity of 
the Schwarzschild metric (2) at the horizon r = 2M and cover the complete 
spacetime manifold (every geodesic either hits the singularity or can be con- 
tinued to the infinite values of its affine parameter), was the most influential 
example which showed that one has to distinguish carefully between just a 
coordinate singularity and the real, physical singularity. It also helped us to 
realize that the definition of a singularity itself is a subtle issue in which the 
concept of geodesic completeness plays a significant role (see [77] for a recent 
analysis of spacetime singularities). 

The character of the Schwarzschild-Kruskal spacetime is best seen in the 
Penrose diagram given in Fig. 3, in which the spacetime is compactifled by 
a suitable conformal rescaling of the metric. Both right and left infinities are 
represented, and the causal structure is well illustrated because worldlines 
of radial light signals (radial null geodesics) are 45-degree lines in the dia- 
gram. In particular the black hole region II and a “newly emerged” (as a 
consequence of the analytical continuation) white hole region IV (with the 
white-hole singularity at r = 0) are exhibited. For more detailed analyses 
of the Penrose diagram of the Schwarzschild-Kruskal spacetime the reader 
is referred to e.g. [18,19,26,76]. Here we wish to turn in some detail to two 
very important concepts in black hole theory which were first understood 
by the analytic extension of the Schwarzschild solution, and which are not 
often treated in standard textbooks. These are the concepts of the bifurcate 
horizon and of the horizon surface gravity. Jurgen Ehlers played a somewhat 
indirect, but important and noble part in their introduction into literature. 
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Fig. 3. The Penrose diagram of the compactified Schwarzschild-Kruskal spacetime. 
Radial null geodesics are 45-degrees lines. Timelike geodesics reach the future (or 
past) timelike infinities (or i~), null geodesics reach the future (or past) null 
infinities (or J~) and spacelike geodesics lead to spatial infinities . (Notice 
that at the lines t = constant are tangent to each other - this is often not taken 
into account in the literature - see e.g. [26,30].) 



These concepts were the main subject of the last work of Robert Boyer 
who became one of the victims of a mass murder on August 1, 1966, in 
Austin, Texas. Jurgen Ehlers was authorized by Mrs. Boyer to look through 
the scientific papers of her husband, and together with John Stachel, pre- 
pared posthumously the paper [78] from R. Boyer’s notes. Ehlers inserted his 
own discussions, generalized the main theorem on bifurcate horizons, but the 
paper [78] was published with R. Boyer as the only author. 

In the Schwarzschild spacetime there exists the timelike Killing vector, 
d/dt, which when analytically extended into all Schwarzschild-Kruskal man- 
ifold, becomes null at the event horizon r = 2M, and is spacelike in the 
regions II and IV with r < 2M. In Kruskal coordinates it is given by 

= U/4M, = K/4M, k^ = 0, k^ = Q) . (7) 

Hence it vanishes at all points with U = V = 0, 0 € [0,7r],</3 G [0,27 t). 
These points, forming a spacelike 2-sphere which we denote B (in Schwarz- 
schild coordinates given by r = 2M, t = constant), are fixed points of the 
1-dimensional group G of isometries generated by (see Fig. 3). At the 
event horizon the corresponding 1-dimensional orbits are null geodesics, with 
k°‘ being a tangent vector. However, since vanishes at B, these orbits are 
incomplete. 

This (and similar observations for other black hole solutions) motivated 
a general analysis of the bifurcate Killing horizons given in [78]. There it 
is proven for spacetimes admitting a general Killing vector field which 
generates a 1-dimensional group of isometries, that (i) a 1-dimensional orbit 
is a complete geodesic if the gradient of the square vanishes on the orbit, 
(ii) if a geodesic orbit is incomplete, then it is null and ^ 0. In addition, 

if = dx°‘ / dv (v being the group parameter), the affine parameter along 
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the geodesic is w = where k = constant satisfies 

(-a’“=2<“. (8) 

In the Schwarzschild case, with = fc“ = {d/dt)°‘, and considering the 
part V = U of the horizon, we get k = 1/4M. The relation w = e''’' is 
just the familiar equation V = , where V = V + U is the Kruskal null 

coordinate and v is the Eddington-Finkelstein ingoing null coordinate used 
in (4). (Notice that V is indeed the affine parameter along the null geodesics 
at the horizon V = U .) The quantity k, first introduced in [78], has become 
fundamental in modern black hole theory, and also in its generalizations in 
string theory. It is the well-known surface gravity of the black hole horizon. 

With K yf 0, the limit points corresponding to u — >■ —oo,w = 0 are fixed 
points of G. (Unless the spacetime is incomplete, there exists a continuation 
of each null geodesic beyond these fixed points to u> < 0.) One can show 
that the fixed points form a spacelike 2-dimensional manifold B, given by 
[/ = U = 0 in the Schwarzschild case; this “bifurcation surface” is a totally 
geodesic submanifold. By the original definition [79], a Killing horizon is a G 
invariant null hypersurface N on which = 0. (A recent definition [80,81] 
specifies a Killing horizon to be any union of such hypersurfaces.) If k yf 0, 
at each point of B there is one null direction orthogonal to B which is not 
tangent to N = N U B. The null geodesics intersecting B in these directions 
form another null hypersurface, N, which is also a Killing horizon. The union 
iV U iV is called a bifurcate Killing horizon (Fig. 4). 




Fig. 4. The bifurcate Killing horizon consisting of two null hypersurfaces N and N 
which intersect in the spacelike 2-dimensional “bifurcation surface” B. 



Bifurcate Killing horizons exist also in fiat and other curved spacetimes. 
For example, in the boost-rotation symmetric spacetimes (Sect. 11), null 
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hypersurfaces z = ±t form the bifurcate Killing horizon corresponding to the 
boost Killing vector; B, given by 2 = t = 0, is then not compact. (As in the 
Schwarzschild-Kruskal spacetime, a bifurcate Killing horizon locally divides 
the spacetime into four wedges.) However, the first motivation for analyzing 
Killing horizons came from the black hole solutions. 

Both Killing horizons and surface gravity play an important role in black 
hole thermodynamics and quantum field theory on curved backgrounds [82], 
in particular in their two principal results: the Hawking effect of particle cre- 
ation by black holes; and the Unruh effect showing that a thermal bath of 
particles will be seen also by a uniformly accelerated observer in flat space- 
time when the quantum field is in its vacuum state with respect to inertial 
observers. Recently, new results were obtained [83] which support the view 
that a spacetime representing the final state of a black hole formed by collapse 
has indeed a bifurcate Killing horizon, or the Killing horizon is degenerate 
(k = 0). 

2.5 The Schwarzschild Metric as a Case 

Against Lorentz- Covariant Approaches 

There are many other issues on which the Schwarzschild solution has made an 
impact. Some of astrophysical applications will be very briefly mentioned later 
on. As the last theoretical point in this section I would like to discuss in some 
detail the causal structure of the Schwarzschild spacetime including infinity. 
By analyzing this structure, Penrose [84] presented evidence against various 
Lorentz (Poincare)-covariant field theoretical approaches, which regard the 
physical metric tensor g to be not much different from any other tensor 
in Minkowski spacetime with flat metric rj (see e.g. [85,86]). I thought it 
appropriate to mention this point here, since Jurgen Ehlers, among others, 
certainly does not share a field theoretical viewpoint. 

The normal procedure of calculating the metric g in these approaches is 
from a power series expansion of Lorentz-covariant terms (in quantum the- 
ory this corresponds to an infinite summation of Feynman diagrams). The 
derived field propagation has to follow the true null cones of the curved metric 
g instead of those of rj. However, as Penrose shows, in a satisfactory theory 
the null cones defined by g should not extend outside the null cones de- 
fined by 77 , or “the causality defined by g should not violate the background 
77 -causality”. Following [84], let us write this condition as g < rj. Now at 
first sight we may believe that g < t] is satisfied in the Schwarzschild field 
since its effect is to “slow down” the velocity of light (cf. “signal retarda- 
tion” mentioned in 2.2). However, in the field-theoretical approaches one of 
the main emphasis is in a consistent formulation of scattering theory. This 
requires a good behaviour at infinity. But with the Schwarzschild metric, null 
geodesics with respect to metric g “infinitely deviate” from those with respect 
to rj\ for example, the radial outgoing g null geodesics 9,(p = constant, and 
u = t — r — 2Mlog(r/2M — 1) = constant at r — >■ 00 go “indefinitely far” 
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into the retarded time t — r of rj, and hence, do not correspond to outgoing 
77- null geodesics t — r = constant. One can try to use a different Hat metric 
associated with the Schwarzschild metric g which does not lead to patho- 
logical behaviour at infinity, but then it turns out that g < r] is violated 
locally. In fact, Penrose [84] proves the theorem, showing that there is an 
essential incompatibility between the causal structures in the Schwarzschild 
and Minkowski spacetimes which appears either asymptotically or locally. 

This incompatibility is easily understood with the exact Schwarzschild 
solution, but it is generic, since one is concerned only with the behaviour of 
the space at large distances from a positive-mass source, i.e. with the causal 
properties in the neighbourhood of spacelike infinity 

In the present post-Minkowskian approximation methods for the gener- 
ation of gravitational waves by relativistic sources, a suitable (Bondi-type) 
coordinate system [66] is constructed at all orders in the far wave zone, which 
in particular corrects for the logarithmic deviation of the true light cones with 
respect to the coordinate fiat light cones (cf. contribution by L. Blanchet in 
this volume). 

2.6 The Schwarzschild Metric and Astrophysics 

In his introductory chapter “General Relativity as a Tool for Astrophysics” 
for the Seminar in Bad Honnef in 1996 [87], Jurgen Ehlers remarks that “The 
interest of black holes for astrophysics is obvious... The challenge here is to 
find observable features that are truly relativistic, related, for example, to 
horizons, ergoregions... Indications exist, but - as far as I am aware - no firm 
evidence.” 

There are many excellent recent reviews on the astrophysical evidence 
for black holes (see e.g. [88-90]). It is true, that the evidence points towards 
the presence of dark massive objects - stellar-mass objects in binaries, and 
supermassive objects in the centres of galaxies - which are associated with 
deep gravitational potential wells where Newtonian gravity cannot be used, 
but it does not offer a clear diagnostic of general relativity. 

Many investigations of test particle orbits in the strong-gravity regions 
(r < lOM) have shown basic differences between the motion in the Schwarz- 
schild metric and the motion in the central field in Newton’s theory (e.g. 
[18,76,91]). For example for 3M < r < 6M unstable circular particle orbits 
exist which are energetically unbound, and thus perturbed particles may es- 
cape to infinity; at r = 3M circular photon orbits occur and there are no 
circular orbits for r < 3M. Particles are trapped by a Schwarzschild black 
hole if they reach the region r < 3M. 

About ten years ago, the study of the behaviour of particles and gyro- 
scopes in the Schwarzschild field revived interest in the “classical” problem of 
the definition of gravitational, centrifugal, and other inertial “forces” acting 
on particles and gyros moving on the Schwarzschild or on a more general 
curved backgrounds, usually axisymmetric and stationary (see e.g. [92,93], 
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and many references therein). One would like to have a split of a covariantly 
defined quantity (like an acceleration) into non-covariant parts, the physical 
meaning of which would increase our intuition of relativistic effects in astro- 
physical problems. If, for example, we adopt the view that the “gravitational 
force” is velocity-independent, then we find that at the orbits outside the cir- 
cular photon orbit (r > 3M), the centrifugal force is as in classical physics, 
repulsive, while it becomes attractive inside this orbit, being zero exactly at 
the orbit.® 

Relativistic effects will, of course, play a role in many astrophysical situ- 
ations involving spherical accretion, the structure of accretion disks around 
compact stars and black holes, their optical appearance etc. They have be- 
come an important part of the arsenal of astrophysicists, and they have en- 
tered standard literature (see e.g. [95,96]). Though this whole field of science 
lies beyond the scope of this article, I would like to mention three recent issues 
which provide us with hope that we may perhaps soon meet the challenge 
noted in Jurgen Ehlers’ remarks made in Bad Honnef in 1996. 

The first concerns our Galactic centre. Thanks to new observations of stars 
in the near infrared band it was possible to detect the transverse motions of 
stars (for which the radial velocities are also observed) within 0.1 pc in our 
Galactic centre. The stellar velocities up to 2000 km/sec and their dependence 
on the radial distance from the centre are consistent with a black hole of mass 
2.5 X 10® Mq. In the opinion of some leading astrophysicists, our Galactic 
centre now provides “the most convincing case for a supermassive hole, with 
the single exception of NGG 4258” [88]. (In NGG 4258 a disk is observed 
whose inner edge is orbiting at 1080 km/sec, implying a black hole - “or 
something more exotic” [88] - with a mass of 3.6 x IO^Mq.) Perhaps we shall 
be able to observe relativistic effects on the proper motions of stars in our 
Galactic centre in the not too distant future. 

The second issue concerns the fundamental question of whether observa- 
tions can bring convincing proof of the existence of black hole event hori- 
zons. Very recently some astrophysicists [89] claimed that new observations, 
in particular of X-ray binaries, imply such evidence. The idea is that thin 
disk accretion cannot explain the spectra of some of X-ray binaries. One has 
to use a different accretion model, a so called advection-dominated accretion 
flow model (ADAF) in which most of the gravitational energy released in 
the infalling gas is carried (advected) with the flow as thermal energy, which 
falls on the central object. (In thin disks most of this energy is radiated out 
from the disk.) If the central compact object (for example a neutron star) 
has a hard surface, the thermal energy stored in the flow is re-radiated after 
the flow hits the surface. However, some of the X-ray binaries show such low 
luminosities that a very large fraction of the energy in the flow must be ad- 

® Curiously enough, Feynman in his 1962-63 lectures on gravitation [94] writes 
that “inside r = 2M [not 3M!|... the ‘centrifugal force’ apparently acts as an 
attraction rather than a repulsion” . 
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vected through an event horizon into a black hole [89]. Although Rees [88], 
for example, considers this evidence “gratifyingly consistent with the high- 
mass objects in binaries being black holes”, he believes that it “would still 
not convince an intelligent sceptic, who could postulate a different theory 
of strong-field gravity or else that the high-mass compact objects were (for 
instance) self-gravitating clusters of weakly interacting particles...”. 

For a sceptical optimistic relativist, the most challenging observational 
issue related to black holes probably is to find astrophysical evidence for a 
Kerr metric. We shall come to this point in Sect. 4.3. 

The last (but certainly not the least) issue lies more in the future, but 
eventually should turn out to be most promising. It is connected with both 
the Numerical Relativity Great Challenge Alliance and the “great challenge” 
of experimental relativity: to calculate reliable gravitational wave-forms and 
to detect them. When gravitational waves from stars captured by a super- 
massive black hole, or from a newly forming supermassive black hole, or, 
most importantly, from coalescing supermassive holes will be detected and 
compared with the predictions of the theory, we should learn significant facts 
about black holes [88,97]. Are these so general remarks entirely inappropriate 
in the section on the Schwarzschild solution? 

One of the most important roles of the Schwarzschild solution in the 
development of mathematical relativity and especially of relativistic astro- 
physics stems from its simplicity, in particular from its spherical symmetry. 
This has enabled us to develop the mathematically beautiful theory of lin- 
ear perturbations of the Schwarzschild background and employ it in various 
astrophy sic ally realistic situations (see e.g. [75,76,91], and many references 
therein). Surprisingly enough, this theory does not only give reliable results 
in such problems as the calculation of waves emitted by pulsating neutron 
stars, or waves radiated out from stars falling into a supermassive black hole. 
Very recently we have learned that one can use perturbation theory of a single 
Schwarzschild black hole as a “close approximation” to black hole collisions. 
Towards the end of the collision of two black holes, they will not in fact be 
two black holes, but will merge into a highly distorted single black hole [98]. 
When compared with the numerical results on a head-on collision it has been 
found that this approximation gives predictions for separations A as large as 
A/M - 7. 



3 The Reissner— Nordstrom Solution 

This spherically symmetric solution of the Einstein-Maxwell equations was 
derived independently^^ by H. Reissner in 1916, H. Weyl in 1917, and G. 

In the literature one finds the solution to be repeatedly connected only with 
the names of Reissner and Nordstrom, except for the “exact-solutions-book” 
[61]: there in four places the solution is called as everywhere else, but in one 
place (p. 257) it is referred to as the “Reissner- Weyl solutions” . An enlightening 
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Nordstrom in 1918. It represents a spacetime with no matter sources except 
for a radial electric field, the energy of which has to be included on the right- 
hand side of the Einstein equations. Since Birkhoff’s theorem, mentioned in 
connection with the Schwarzschild solution in Sect. 2.2, can be generalized 
to the electrovacuum case, the Reissner-Nordstrom solution is the unique 
spherical electrovacuum solution. Similarly to the Schwarzschild solution, it 
thus describes the exterior gravitational and electromagnetic fields of an ar- 
bitrary - static, oscillating, collapsing or expanding - spherically symmetric, 
charged body of mass M and charge Q. The metric reads 








dr + \1 



2M 

r 



+ (dd^ -I- sin^ 9 d(p‘^) , 




(9) 



the electromagnetic field in these spherical coordinates is described by the 
“classical” expressions for the time component of the electromagnetic poten- 
tial and the (only non-zero) component of the electromagnetic field tensor: 

= Ft, = -Frt = -^. ( 10 ) 

A number of authors have discussed spherically symmetric, static charged 
dust configurations producing a Reissner-Nordstrom metric outside, some of 
them with a hope to construct a “classical model” of a charged elementary 
particle (see [61] for references). The main influence the metric has exerted on 
the developments of general relativity, and more recently in supersymmetric 
and superstring theories (see Sect. 3.2), is however in its analytically extended 
electrovacuum form when it represents charged, spherical black holes. 



3.1 Reissner Nordstrom Black Holes 

and the Question of Cosmic Censorship 

The analytic extensions have qualitatively different character in three cases, 
depending on the relationship between the mass M and the charge Q. In 
the case (corresponding, for example, to the field outside an elec- 

tron), the complete electrovacuum spacetime is covered by the coordinates 
{t,r,9,ip), 0 < r < oo. There is a naked singularity (visible from infinity) at 
r = 0 in which the curvature invariants diverge. If < M^, the metric (9) 

discussion on p. 209 in [61] shows that the solution belongs to a more general 
“Weyl’s electrovacuum class” of electrostatic solutions discovered by Weyl (in 
1917) which follow from an Ansatz that there is a functional relationship between 
the gravitational and electrostatic potentials. As will be noticed also in the case 
of cylindrical waves in Sect. 9, if “too many” solutions are given in one paper, the 
name of the author is not likely to survive in the name of an important subclass... 
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describes a (generic) Reissner-Nordstrom black hole] it becomes singular at 
two radii: 

r = r±=M±{M^ (11) 

Similarly to the Schwarzschild case, these are only coordinate singularities. 
Graves and Brill [99] discovered, however, that the analytic extension and 
the causal structure of the Reissner-Nordstrom spacetime with is 

fundamentally different from that of the Schwarzschild spacetime. There are 
two null hypersurfaces, at r = r+ and r = r-, which are known as the outer 
(event) horizon and the inner horizon] the Killing vector d/dt is null at the 
horizons, timelike at r > r_|_ and r < r_, but spacelike at r_ < r < r_|_. 
The character of the extended manifold is best seen in the Penrose diagram 
in Fig. 5, in which the spacetime is compactified by a suitable conformal 
rescaling of the metric (see, e.g. [18,26,30]). As in the compactified Kruskal- 
Schwarzschild diagram in Fig. 3, the causal structure is well illustrated be- 
cause worldlines of radial light signals are 45-degree lines. There are again 
two infinities illustrated - the right and left - in regions I and III. However, 
the maximally extended Reissner-Nordstrom geometry consists of an infinite 
chain of asymptotic regions connected by “wormholes” between the real sin- 
gularities (with divergent curvature invariants) at r = 0. In Fig. 5, the right 
and left (past null) infinities in regions /' and III' are still seen - the others 
are obtained by extending the diagram vertically in both directions. 

An important lesson one has learned is that the character of the singu- 
larity need not be spacelike as it is in the Schwarzschild case, or with the 
big bang singularities in standard cosmological models. Indeed, the singu- 
larities in the Reissner-Nordstrom geometry are timelike: they do not block 
the way to the future. By solving the geodesic equation one can show that 
there are test particles which start in “our universe” (region T), cross the 
outer horizon at r = and the inner horizon at r = r_, avoid the singu- 
larity and through a “white hole” (the outer horizon between regions IV 
and I') emerge into “another universe” I' with its own asymptotically flat 
region. Such a gravitational bounce can occur not only with test particles. The 
studies of the gravitational collapse of charged spherical shells ([100] and ref- 
erences therein) and of charged dust spheres ([101] and references therein) 
have shown that a bounce can take place also in fully dynamical cases. The 
part of Fig. 5 which is “left” from the worldline of the surface of the sphere 
or the shell is “covered” by the interior of the sphere or flat space inside 
the shell. As observed in [100], the outcome of the bounce of a shell can be 

An intuitive explanation [101] of this bounce is that as the sphere (the shell) 
contracts, the volume of the exterior region increases, and hence also the total 
energy in the electric field, which eventually exceeds the energy in the sphere. 
However, the external plus internal energy does not change during collapse (there 
are no waves), and so in the neighborhood of a highly contracted charged object, 
the gravitational held must have a repulsive character corresponding to a negative 
mass-energy. 
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Fig. 5. The compactified Reissner-Nordstrom spacetime representing a non- 
extreme black hole consists of an indefinite chain of asymptotic regions ( “universes” ) 
connected by “wormholes” between timelike singularities. The worldline of a shell 
collapsing from “universe” / and re-emerging in “universe” I' is indicated. The 
inner horizon at r = r_ is the Cauchy horizon for spacelike hypersurface E. It is 
unstable and will thus very likely prevent such a process occuring. 



different, depending on the value of the shell’s total mass, charge and rest 
mass. The shell may crash into the “right” singularity or it may continue to 
expand and emerge in region I' . If the rest mass of the shell is negative the 
collapse may even lead to a naked singularity. 

Now even if the shell collapses into a black hole, and after a bounce, 
emerges in region a locally naked singularity is present: the timelike sin- 
gularity at r = 0, to the “right” from the wordline of the shell. An observer 
travelling into the future “between” the shell and the singularity can be sur- 
prised by a signal coming from the singularity (see Fig. 5). Penrose’s strong 
cosmic censorship conjecture (see e.g. [102]) suggests that this should not 
happen. In its physical formulation, as given by Wald (see [19] also for the 
precise formulation), it says that all physically reasonable spacetimes are 
globally hyperbolic, i.e. apart from a possible initial (big bang-type) singu- 
larity, no singularity is visible to any observer. It was just the example of the 
Reissner-Nordstrom solution (and a similar property of the Kerr solution) 
which inspired Penrose to formulate the strong cosmic censorship conjec- 
ture, in addition to its weak version which only requires that from generic 
nonsingular initial data on a Cauchy hypersurface no spacetime singularity 
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develops which is visible from infinity. As Penrose [102] puts it, “it seems to 
be comparatively unimportant whether the observer himself can escape to 
infinity”. 

It is evident from Fig. 5 that the Reissner-Nordstrom spacetime is not 
globally hyperbolic, i.e. it does not possess a Cauchy hypersurface E, the 
initial data on which (for a test field, say) would determine the development 
of data in the entire future. If data are given on the spacelike hypersurface 
E “connecting” left and right infinities of regions III and /, the Cauchy 
development will predict what happens only in regions III and I above E, and 
in region //, i.e. not beyond the null hypersurfaces (inner horizons) r = r- 
between region II and regions V in the figure. The inner horizons r = r_ 
represent the Cauchy horizon for a typical initial hypersurface like E. As 
noticed above, what is happening at an event in regions V is in general 
influenced not only by data on E but also by what is happening at the 
(locally) naked singularities (which cannot be predicted since the physics at 
a singularity cannot be controlled). 

Penrose was also the first who predicted that the inner (Cauchy) horizon 
is unstable [27]. If this is true, a null singularity, or possibly even a spacelike 
singularity may arise during a general collapse, so preventing a violation of the 
strong cosmic censorship conjecture. The instability of the Cauchy horizon 
can in fact be expected by using first the following simple geometrical-optics 
argument. 

Introduce the ingoing Eddington-Finkelstein null coordinate v by 
v = t + r* = t + j f{r)dr, 

f=l-2M/r + Q^/rfi (12) 

which brings the metric (9) into the form as of equation (4) in the Schwarz- 
schild case. Consider a freely falling observer who is approaching the inner 
horizon given by r = r-,v = oo. Denoting the observer’s constant spe- 
cific energy parameter (see e.g. [18]) by if = —U^, where the Killing vector 
^ = d/dv, observer’s four-velocity U = d/dr (r - observer’s proper time), 
the geodesic equations imply 

f^ + f = Efi V = f-\E - - f)hl (13) 

where a dot denotes d/dr. Between the horizons, ^ is spacelike and E can 

be negative. Geodesic equations (13) imply dr/dv = for an observer with 

E < 0, approaching r = r- from region II. Expanding / near r = r-, 

f = /'(r_)(r — r_) = — 2«:(r — r_), where 

K = - Q‘^)i /rf (14) 

is the surface gravity of the inner horizon (as it follows from definition (8)), 
and integrating, we get / near r_. From the second equation in (13) we then 
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obtain the “asymptotic formula” 

V ~ constant | E \ e”” = ^ , v ^ oo, At — >• 0, (15) 

where At is the amount of proper time the observer needs to reach the inner 
(Cauchy) horizon. 

Imagine now two nearby events Bgut in the outside world /, for 

example the emission of two photons from a given fixed r > r+, which are 
connected by the ingoing null geodesics v = constant and v + dv = constant 
with events and Bin on the worldline of the observer approaching r_ . The 
interval of proper time between Agut and Bout is dTout ^ dv, whereas (15) 
implies that the interval of proper time between Ain and Bin, as measured 
by the observer approaching r_, is dTin ^ e~'^'"dv. Therefore, 



di"' 

dTout 



e 



— KV 



(16) 



so that as V ^ +oo the events (clocks) in the outside world are measured to 
proceed increasingly fast by the inside observer approaching the inner hori- 
zon. In the limit, when the observer crosses the Cauchy horizon, he sees the 
whole future history (from some event as Agut) of the external universe to 
“proceed in one flash”: dTin 0 with dTout oo- An intuitive explanation 
is given by the fact that the observers in region I need infinite proper time 
to reach v = oo, whereas inside observers only finite proper time. The in- 
falling radiation will thus be unboundedly blue-shifted at the inner horizon 
which in general will lead to a divergence of the energy density there. This 
infinite blueshift at the inner (Cauchy) horizon makes it generally unstable 
to perturbations (“the blue-sheet instability”). 

There exists an extensive literature analyzing the exciting questions of 
the black hole interiors (see for example [76], the introductory review [103] 
in the proceedings of the recent workshop devoted entirely to these issues, 
and other contributions in the proceedings which give also many further 
references). Some crucial questions are still the subject of much debate. One 
of the following two approaches to the problem is usually chosen: (i) a linear 
perturbation analysis of the behaviour of fields at the Cauchy horizon, (ii) 
the simplified nonlinear, spherically symmetric models of black hole interiors. 

In the first approach one considers the evolution of linear perturbations, 
representing scalar, electromagnetic, or gravitational fields, on the Reissner- 
Nordstrom background. Since there is a nonvanishing background electric 
field, the electromagnetic and gravitational perturbations are coupled.^^ It is 

This leads to various interesting phenomena. For example, the scattering of inci- 
dent electromagnetic and gravitational waves by the Reissner-Nordstrom black 
hole allows for the partial conversion of electromagnetic waves into gravitational 
waves and vice versa [91]. When studying stationary electromagnetic fields due 
to sources located outside the Reissner-Nordstrom black hole, one discovers that 
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a remarkable fact that “wave equations” for certain gauge-invariant combi- 
nations of perturbations can be derived from which all perturbations can 
eventually be constructed [91,105,106]. In the simplest case of the scalar field 
<P on the Reissner-Nordstrom background, after resolving the field into spher- 
ical harmonics and putting <I> = r^, the wave equation has the form [107] 

'Z'.tt-tf'.r-r* +F/(r*)f = 0, (17) 



where the curvature-induced potential barrier is given by 



Fiin 






i{i+i) 

J.2 



(18) 



where r is considered to be a function of r* (cf. (12)). In order to determine 
the evolution of the field below the outer horizon in a real gravitational col- 
lapse, one first concentrates on the evolution of the field outside a collapsing 
body (star). A nonspherically symmetric scalar test field (generated by a non- 
spherical distribution of “scalar charge” in the star) serves as a prototype for 
(small) asymmetries in the external gravitational and electromagnetic fields, 
which are generated by asymmetries in matter and charge distributions inside 
the star. Now when a slightly nonspherical star starts to collapse, the per- 
turbations become dynamical and propagate as waves. Their evolution can 
be determined by solving the wave equation (17). Because of the potential 
barrier (18) the waves get backscattered and produce slowly decaying radia- 
tive tails, as shown in the classical papers by Price [108,109], and generalized 
to the Reissner-Nordstrom case in [107,110]. The tails decay in the vicinity 
of the outer event horizon r_|_ (i.e. between regions I and II in Fig. 5) as 
I' ^ t;“2(i+i) fQj. /_pole perturbations.^^ The decaying tails provide the initial 
data for the “internal problem” - the behaviour of the field near the Cauchy 
horizon. Calculations show (see [103] and references therein) that near the 
Cauchy horizon the behaviour of the field remains qualitatively the same: 
iF(m, u— >-oo) ^ _|_ |3iowly varying function of m}, where u = 2r* — v 

is constant along outgoing radial null geodesics in region II. However, as a 
consequence of the “exponentially growing blueshift”, given by formula (15), 
the rate of change of the field diverges as the observer approaches the Cauchy 
horizon'. d'P / dr = (dW/dx°‘)U°‘ ~ u ^ Therefore, the mea- 

sured energy density in the field would also diverge, causing an instability of 
the Cauchy horizon, which would be expected to create a curvature singular- 
ity. More detailed considerations [103] show that the singularity, at least for 

closed magnetic field lines not linking any current source may exist, since gravita- 
tional perturbations constitute, via the background Maxwell tensor, an effective 
source [104]. 

This result is true for a general charged Reissner-Nordstrom black hole with 
< M^. In the extremal case, = M^, the field decays only as ~ 

[107]. 
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large |it|, is null and weak (the metric is well-defined, only the Riemann ten- 
sor is singular). Any definitive picture of the Cauchy horizon instability can 
come however only from a fully nonlinear analysis which takes into account 
the backreaction of spacetime geometry to the growing perturbations. 

The second approach to the study of the Cauchy horizon instabilities em- 
ploys a simplified, spherically symmetric model which treats the nonlineari- 
ties exactly [111]. The ingoing radiation is modelled by a stream of ingoing 
charged null dust [112] which is infinitely blueshifted at the inner horizon. 
There is, however, also an outgoing stream of charged null dust considered 
to propagate into region II towards the inner horizon. The outgoing flux may 
represent radiation coming from the stellar surface below the outer horizon, 
as well as a portion of the ingoing radiation which is backscattered in region 
II, and irradiates thus the inner horizon. A detailed analysis based on exact 
spherically symmetric solutions revealed a remarkable effect: an effective in- 
ternal gravitational-mass parameter of the hole unboundedly increases at the 
inner (Cauchy) horizon (though the external mass of the hole remains finite). 
This “mass inflation phenomenon” causes the divergence of some curvature 
scalars at the Cauchy horizon [111]. In reality, the classical laws of general 
relativity will break down when the curvature reaches Planckian values. 

It is outside the scope of this review to discuss further the fascinating 
issues of black hole interiors. They involve deep questions of classical rela- 
tivity, of quantum field theory on curved background (as, for example, in 
discussions of electromagnetic pair production and vacuum polarization ef- 
fects inside black holes), and they lead us eventually to quantum gravity. We 
refer again especially to [76] and [103] for more information. Let us only add 
three further remarks. We mentioned above the work on the inner structure 
of Reissner-Nordstrom black holes because this is the most explored (though 
not closed) area. However, Kerr black holes (Sect. 4) possess also inner hori- 
zons and there are many papers concerned with the instabilities of the Kerr 
Cauchy horizons (see [76,103] for references). Secondly, at the beginning of 
1990s, it was shown that the inner horizons of the Reissner-Nordstrom-de 
Sitter and Kerr-de Sitter black holes are classically stable in the case when 
the surface gravity at the inner horizons is smaller than the surface gravity 
at the cosmological horizon ([103] and references therein, in particular, the 
review [113]). Penrose [114] even suggested that “it may well be that cosmic 
censorship requires a zero (or at least a nonpositive) cosmological constant” . 
Very recently, however, three experts in the field [115] have claimed that out- 
going modes near to the black hole (outer) event horizon lead to instability 
for all values of the parameters of Reissner-Nordstrom-de Sitter black holes. 
Let me borrow again a statement from Penrose [114]: “My own feelings are 
left somewhat uncertain by all these considerations” . 

Finally, a new contribution [116] to the old problem of testing the weak 
cosmic censorship by employing a Reissner-Nordstrom black hole indicates 
that one can overcharge a near extreme — >■ M^) black hole by throwing in 
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a charged particle appropriately. However, the backreaction effects remain to 
be explored more thoroughly. The question of cosmic censorship thus remains 
as interesting as ever. 

3.2 On Extreme Black Holes, d-Dimensional Black Holes, 

String Theory and “All That” 

In the previous section we considered generic Reissner-Nordstrom black holes 
with > Q^. They have outer and inner horizons given by (11), with 
nonvanishing surface gravities (cf. (14) for the inner horizon). For = 
the two horizons coincide at r+ = r_ = M. Defining the ingoing null 
coordinate v as in (12), we obtain the ingoing extension of the Reissner- 
Nordstrom metric (9) in the form 



This is the metric of extreme Reissner-Nordstrom black holes. Frequently, 
these holes are called “degenerate” . At the horizon r = M, the Killing vector 
field k = d/dv obeys the equation {k°‘ka ),(3 = 0, so that regarding the general 
relation (8), the surface gravity k = 0, i.e. the Killing horizon is degenerate. 
Using {k‘^),fs = 0 and the Killing equation, we easily deduce that the horizon 
null generators with tangent A:“ = dx°‘/dv satisfy the geodesic equation with 
affine parameter v. The generators have infinite affine length to the past given 
by u — > — oo (in contrast to the generators of a bifurcate Killing horizon - cf. 
Sect. 2.4). This part of the extreme Reissner-Nordstrom spacetime, given by 
r = M, V — ^ — oo, is called an “internal infinity”. That there is no “wormhole” 
joining two asymptotically flat regions and containing a minimal surface 2- 
sphere like in the non-extreme case can also be seen from the metric in the 
original Schwarzschild-type coordinates. Considering an embedding diagram 
t = constant, 9 = tt/ 2 in flat Euclidean space one finds that an infinite “tube”, 
or an asymptotically cylindrical region on each t = constant hypersurface 
develops. The boundary of the cylindrical region is the internal infinity. It is 
a compact 2-dimensional spacelike surface. The hypersurfaces t = constant 
do not intersect the horizon but only approach such an intersection at the 
internal infinity. (See [79] for the conformal diagram and a detailed discussion, 
including analysis of the electrovacuum Robinson-Bertotti universe as the 
asymptotic limit of the extreme Reissner-Nordstrom geometry at the internal 
infinity. ) 

There has been much interest in the extreme Reissner-Nordstrom black 
holes within standard Einstein-Maxwell theory. They admit surprisingly sim- 
ple solutions of the perturbation equations [117]. Some of them appear to be 
stable with respect to both classical and quantum processes, and there are 
attempts to interpret them as solitons [118]. Also, they admit supersymmetry 



ds^ = — dr'^ + 2dvdr + r^{d9^ + sin^ 6dif'^). (19) 




[ 119 ]. 
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The quotation marks in the title of this section play a double role: the 
last two words are just “quoting” from the end of the title of a general re- 
view on string theory and supersymmetry prepared for the special March 
1999 issue of the Reviews of Modern Physics in honor of the centenary of 
the American Physical Society by Schwarz and Seiberg [120], but they also 
should “self-ironically” indicate my ignorance in these issues. In addition, 
unified theories of the type of string theory appear to be somewhat outside 
the direct interest of Jurgen Ehlers, who has always emphasized the depth 
and economy of general relativity because it is a “background-independent” 
theory: string theories still suffer from the lack of a background-independent 
formulation. Nevertheless, they are beautiful, consistent, and very challeng- 
ing constructions, representing one of the most active areas of theoretical 
physics. Recently, string theory provided an explanation of the Bekenstein- 
Hawking prediction of the entropy of extreme and nearly extreme black holes. 
From the point of view of this review we should emphasize that many of the 
techniques that have been used to obtain exact solutions - mostly exact black 
hole solutions - in generalized theories like string theory were motivated by 
classical general relativity. There are also results in classical general relativity 
which are finding interesting generalizations to string theories, as we shall see 
with one example below. 

Before making a few amateurish comments on new results concerning 
extreme black holes in string theories, let us point out that in many pa- 
pers from the last 20 years, black hole solutions were studied in spacetimes 
with the number of dimensions either lower or higher than four. The lower- 
dimensional cases are usually analyzed as “toy models” for understanding the 
complicated problems of quantum gravity. The higher-dimensional models 
are motivated by efforts to find a theory which unifies gravity with the other 
forces. The most surprising and popular (2-|-l)-dimensional black hole is the 
BTZ (Bahados-Teitelboim-Zanelli) black hole in the Einstein theory with a 
negative cosmological constant. Locally it is isometric to anti de Sitter space 
but its topology is different. In [121] the properties of (2-|-l)-dimensional 
black holes are reviewed. In (l-l-l)-dimensions one obtains black holes only if 
one includes at least a simple dilaton scalar field; the motivation for how to 
do this comes from string theory. In higher dimensions one can find general- 
izations of all basic black hole solutions in four dimensions [122]. Interesting 
observations concerning higher-dimensional black holes have been given a 
few years ago [123]. Perhaps one does not need to quantize gravity in order 
to remove the singularities of classical relativity. It may well be true that 
some new classical physics intervenes below Planckian energies. In [123] it is 
demonstrated that certain singularities of the four-dimensional extreme dila- 
ton black holes can be resolved by passing to a higher-dimensional theory of 
gravity in which usual spacetime is obtained only below some compactifica- 
tion scale. A useful, brief pedagogical introduction to black holes in unified 
theories is contained in [76]. 
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One of the most admirable recent results of string theory, which undoubt- 
edly converted some relativists and stimulated many string theorists, has 
been the derivation of the exact value of the entropy of extreme and nearly 
extreme black holes. I shall just paraphrase a few statements from the March 
1999 review for the centenary of the American Physical Society by Horowitz 
and Teukolsky [124]. There are very special states in string theory called 
BPS (Bogomorny-Prasad-Sommerfield) states which saturate an equality 
M > c\Q\, with M being the mass, Q the charge, and c is a fixed constant. 
The mass of these special states does not get any quantum corrections. The 
strength of the interactions in string theory is determined by a coupling con- 
stant g. One can count BPS states at large Q and small g. By increasing g one 
increases gravity, and then all of these states become black holes. (The BPS 
states are supersymmetric and one can thus follow the states from weak to 
strong coupling.) But they all become identical extreme Reissner-Nordstrom 
black holes, because there is only one black hole for given M = \Q\. When 
one counts the number N of BPS states in which an extreme hole can exist, 
and compares this with the entropy Sbk = of the hole as obtained in 
black hole thermodynamics [82,125], where A is the area of the event horizon 
{A = for the extreme Reissner-Nordstrom black hole), one finds ex- 

actly the “classical” result: Sbh = log TV!. The entropy of the classical black 
hole configuration is given in terms of the number of quantum microstates 
associated with that configuration, by the basic formula of statistical physics. 
For more detailed recent reviews, see [126,127], and references therein. Re- 
markably, the results for the black hole entropy have been obtained also 
within the canonical quantization of gravity [128]. A comprehensive review 
[129] of black holes and solitons in string theory appeared very recently. 

Allow me to finish this “all that” section with a personal remark. In 1980 
L. Dvorak and I found that in the Einstein-Maxwell theory, external mag- 
netic flux lines are expelled from the black hole horizon as the hole becomes 
an extreme Reissner-Nordstrom black hole [104]. Hence, extreme black holes 
exhibit some sort of “Meissner effect” known from superconductivity. Last 
year it was demonstrated by Chamblin, Emparan and Gibbons [130] that this 
effect occurs also for black hole solutions in string theory and Kaluza-Klein 
theory. Other extremal solitonic objects in string theory (like p-branes) can 
also have superconducting properties. Within the Einstein-Maxwell theory 
this effect was first studied to linear order in magnetic field - we analyzed 
Reissner-Nordstrom black holes in the presence of magnetic fields induced 
by current loops. However, we also used an exact solution due to Ernst [131], 
describing a charged black hole in a background magnetic field, which asymp- 
totically goes over to a Melvin universe, and found the same effect (see also 
[132] for the case of the magnetized Kerr-Newman black hole). In [130] the 
techniques of finding exact solutions of Einstein’s field equations are employed 
within string theory and Kaluza-Klein theory to demonstrate the “Meissner 
effect” in these theories. 
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4 The Kerr Metric 

The discovery of the Kerr metric in 1963 and the proof of its unique role in 
the physics of black holes have made an immense impact on the development 
of general relativity and astrophysics. This can hardly be more eloquently 
demonstrated than by an emotional text from Chandrasekhar [60]: “In my 
entire scientific life, extending over forty-five years, the most shattering expe- 
rience has been the realization that an exact solution of Einstein’s equations 
of general relativity, discovered by the New Zealand mathematician Roy Kerr, 
provides the absolutely exact representation of untold numbers of massive 
black holes that populate the Universe...” 

In Boyer-Lindquist coordinates the Kerr metric [133] looks as follows (see 
e.g. [18,30]): 



where 



ds^ = - 1- 
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+ ^dr"^ + SdO'^ + ^sin^0d(/3^, 
Z\ E 



( 20 ) 



E = + a?' cos^ 0, A = — 2Mr + a?', 

A = E{r^ + a?) + 2Mra? 9, (21) 

where M and a are constants. 

4.1 Basic Features 

The Boyer-Lindquist coordinates follow naturally from the symmetries of the 
Kerr spacetime. The scalars t and (p can be fixed uniquely (up to additive 
constants) as parameters varying along the integral curves of (unique) sta- 
tionary and axial Killing vector fields k and m; and the scalars r and 9 can 
be fixed (up to constant factors) as parameters related as closely as possi- 
ble to the (geometrically preferred) principal null congruences, which in the 
Kerr spacetime exist (see e.g. [18,30]), and their projections on to the two- 
dimensional spacelike submanifolds orthogonal to both k and m (see [134] 
for details). The Boyer-Lindquist coordinates represent the natural general- 
ization of Schwarzschild coordinates. With a = 0 the metric (20) reduces to 
the Schwarzschild metric. 

By examining the Kerr metric in the asymptotic region r — >■ oo, one finds 
that M represents the mass and J = M a the angular momentum pointing 
in the z-direction, so that a is the angular momentum per unit mass. One 
can arrive at these results by considering, for example, the weak field and 
slow motion limit, M/r <C 1 and a/r <C 1. The Kerr metric (20) can then be 
written in the form 
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+r^ {dO^ + sin^ 9 dip^) 
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sin^ 6 dp dt, 



( 22 ) 



which is the weak field metric generated by a central body with mass M 
and angular momentum J = Ma. A general, rigorous way of interpreting the 
parameters entering the Kerr metric starts from the definition of multipole 
moments of asymptotically fiat, stationary vacuum spacetimes. This is given 
in physical space by Thorne [135], using his “asymptotically Cartesian and 
mass centered” coordinate systems, and by Hansen [136], who, generalizing 
the definition of Geroch for the static case, gives the coordinate independent 
definition based on the conformal completion of the 3-dimensional manifold 
of trajectories of a timelike Killing vector k. The exact Kerr solution has 
served as a convenient “test-bed” for such definitions.^^ The mass monopole 
moment ~ the mass - is M, the mass dipole moment vanishes in the “mass- 
centered” coordinates, the quadrupole moment components are ^Maf and 
— The current dipole moment - the angular momentum - is nonvan- 

ishing only along the axis of symmetry and is equal to J = Ma, while the 
current quadrupole moment vanishes. All other nonvanishing /-pole moments 
are proportional to Ma} [135,136]. Because these specific values of the mul- 
tipole moments depend on only two parameters, the Kerr solution clearly 
cannot represent the gravitational field outside a general rotating body. In 
Sect. 6.2 we indicate how the Kerr metric with general values of M and a 
can be produced by special disk sources. The fundamental significance of the 
Kerr spacetime, however, lies in its role as the only vacuum rotating black 
hole solution. 

Many texts give excellent and thorough discussions of properties of Kerr 
black holes from various viewpoints [18,19,26,30,70,75,76,79,91,95,138]. The 
Kerr metric entered the new edition of “Landau and Lifshitz” [139]. A few 
years ago, a book devoted entirely to the Kerr geometry appeared [140]. Here 
we can list only a few basic points. 

As with the Reissner-Nordstrom spacetime, one can make the maximal 
analytic extension of the Kerr geometry. This, in fact, has much in common 
with the Reissner-Nordstrom case. Loosely speaking, the “repulsive” char- 
acters of both charge and rotation have somewhat similar manifestations. 
When < M^, the metric (20) has coordinate singularities at A = 0, i.e. at 
(cf. (21)) 

r = r± = M (23) 

For the most complete, rigorous treatment of the asymptotic structure of station- 
ary spacetimes characterized uniquely by multipole moments defined at spatial 
infinity, see the work by Beig and Simon [137], the article by Beig and Schmidt 
in this volume, and references therein. 
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Fig. 6. A schematic picture of a Kerr black hole with two horizons, ergosphere, and 
the ring singularity; local light wavefronts are also indicated. Particle A, entering 
the ergosphere from infinity, can split inside the ergosphere into particles B and C 
in such a manner that C arrives at infinity with a higher energy than particle A 
came in. 



The intrinsic three-dimensional geometry at r = reveals that these are 
null hypersurfaces - the (outer) event horizon and the inner horizon (Fig. 
6). As with the Reissner-Nordstrom metric, the inner horizon - the Cauchy 
hypersurface - is unstable (see a more detailed discussion in Sect. 3.2 and 
references there). And as in the Penrose diagram in Fig. 5, one finds infinitely 
many asymptotically flat regions in the analogous Penrose diagram for the 
Kerr black hole spacetime. 

A crucial difference between the Reissner-Nordstrom and Kerr geome- 
try is the existence of the ergosphere (or, more precisely, ergoregion) in the 
Kerr case. This is caused by the dragging of inertial frames due to a non- 
vanishing angular momentum. The timelike Killing vector k, given in the 
Boyer-Lindquist coordinates by d/dt, becomes null “sooner”, at r = ro, than 
at the event horizon, tq > r_|_ at 0 0, tt, as a consequence of this dragging: 

= 1 - 2Mr/S = 0, 

r = ro = M + {M^ — of cos^ ff) = . (24) 

This is the location of the ergosurface, the ergoregion being between this sur- 
face and the (outer) horizon. In the ergosphere, schematically illustrated in 
Fig. 6, the “rotating geometry” drags the particles and light (with wavefronts 
indicated in the figure) so strongly that all physical observers must corotate 
with the hole, and so rotate with respect to distant observers - “fixed stars” - 
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at rest in the Boyer-Lindquist coordinates^® Static observers, whose world- 
lines (r, 0, ip) = constant would have k as tangent vectors, cannot exist since 
k is spacelike in the ergosphere. Indeed, a non-spacelike worldline with r, 9 
fixed must satisfy the condition 

gttdt^ + g^ipdif'^ + 2g^tdifdt < 0, (25) 



in which gtt = —k°‘ka, g^^^p = m“mQ, gp,t = k°^ma are invariants. In the 
ergosphere, the metric (20), (21) yields gtt > 0, g^p^ > < 0, so that 

dif/dt > 0 - an observer moving along a non-spacelike worldline must corotate 
with the hole. The effect of dragging on the forms of photon escape cones 
in a general Kerr field (without restriction < M^) has been numerically 
studied and carefully illustrated in a number of figures only recently [142] . 

In order to “compensate” the dragging, the congruence of “locally nonro- 
tating frames” (LNRFs), or “zero-angular momentum observers” (ZAMOS), 
has been introduced. These frames have also commonly been used outside 
relativistic, rapidly rotating stars constructed numerically, but the Kerr met- 
ric played an inspiring role (as, after all, in several other issues, such as in 
understanding the ergoregions, etc.). The four- velocity of these (not freely 
falling!) observers, given in Boyer-Lindquist coordinates by 



e 



(i) - 



{A/SA)^,0,0,2aMr/{ASA)^ , 



(26) 



is orthogonal to the hypersurfaces t = constant. The particles falling from 
rest at infinity with zero total angular momentum fall exactly in the radial 
direction in the locally nonrotating frames with an orthogonal triad tied to 
the r,9,if coordinate directions (see [143] for the study of the shell of such 
particles falling on to a Kerr black hole) . 

Now going down from the ergosphere to the outer horizon, we find that 
both Killing vectors k and m are tangent to the horizon, and are spacelike 
there (with k “rotating” with respect to infinity). The null geodesic gen- 
erators of the horizon are tangent to the null vectors I = k -\- Qm, where 
17 = a/2Mr^ = constant is called the angular velocity of the hole. 17 is con- 
stant over the horizon so that the horizon rotates rigidly. Since i is a Killing 
vector, the horizon is a Killing horizon (cf. Sect. 2.4). 

Another notable difference from the Reissner-Nordstrom metric is the 
character of the singularity at A = 0, i.e. at r = 0, 9 = tt/ 2. It is timelike, 
as in the Reissner-Nordstrom case, but it is a ring singularity (see Fig. 6). 
In the maximal analytic extension of the Kerr metric one can go through 
the ring to negative values of the coordinate r and discover closed timelike 

It is instructive to analyze the somewhat “inverse problem” of gravitational col- 
lapse of a slowly rotating dust shell which produces the Kerr metric, linearized 
in a/M, outside (cf. Eq.(22)), and has flat space inside. Fixed distant stars seen 
from the centre of such shell appear to rotate due to the dragging of inertial 
frames, as was discussed in detail recently [141]. 
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lines since < 0 there. If the Kerr parameters are such that a? > the 
Kerr metric does not represent a black hole. It describes the gravitational 
field with a naked ring singularity. The Kerr ring singularity has a repulsive 
character near the rotation axis. It gives particles outward accelerations and 
collimates them along the rotation axis [144], which might be relevant in 
the context of the formation and precollimation of cosmic jets. However, the 
cosmic censorship conjecture is a very plausible, though difficult “to prove” 
hypothesis, and Kerr naked singularities are unlikely to form in nature. How- 
ever, the Kerr geometry with > M^, with a region containing the ring 
singularity “cut out”, can be produced by thin disks; though if they should 
be composed of physical matter, they cannot be very relativistic (see Sect. 
6.2 and references therein). 

If the Kerr solution represents an extreme Kerr black hole, as is 

the analogous Reissner-Nordstrom case with = M^. The inner and outer 
horizons then coincide at r = M . The horizon is degenerate with infinite affine 
length. Almost extreme Kerr black holes probably play the most important 
role in astrophysics (see below) . In realistic astrophysical situations accreting 
matter will very likely have a sufficient amount of angular momentum to turn 
a Kerr hole to an almost extreme state. 

There exists a charged, electrovacuum generalization of the Kerr fam- 
ily found by Newman et al. [145]. The Kerr-Newman metric in the Boyer- 
Lindquist coordinates can be obtained from the Kerr metric (20) if all of 
the terms 2Mr explicitly appearing in (20), (21) are replaced by 2Mr — , 

with Q being the charge. The metric describes charged, rotating black holes 
if + Q^, with two horizons located at = M ± (M^ — a? — Q^) i . 

These become extreme when + Q'^, and with one 

obtains naked (ring) singularities. The analytic extension, the presence of 
ergoregions and the structure of the singularity is similar to the Kerr case. 

In addition to the gravitational field, there exists a stationary electro- 
magnetic field which is completely determined by the charge Q and rotation 
parameter a. The vector potential of this field is given by the 1-form 

Aadx'^ = — {Qr/E){dt — a sin^ 9 dip), (27) 

so that if a yf 0 the electric field is supplemented by a magnetic field. At large 
distances (r — ^ oo) the field corresponds to a monopole electric field with 
charge Q and a dipole magnetic field with magnetic moment p = Qa. Since 
the gyromagnetic ratio of a charged system with angular momentum J is 
defined by 7 = p/ J, one finds the charged-rotating-black hole gyromagnetic 
ratio to satisfy the relation 7 = Q/M, i.e. it is twice as large as that of 
classical matter, and the same as that of an electron. By examining a black 
hole with a loop of rotating charged matter around it, the radius of the loop 
changing from large values to the size of the horizon, it is possible to gain 
some understanding of this result [146]. 



The Role of Exact Solutions 



47 



4.2 The Physics and Astrophysics Around Rotating Black Holes 

In the introduction to their new 770 page monograph on black hole physics, 
Frolov and Novikov [76] write: there are a lot of questions connected 

with black hole physics and its applications. It is now virtually impossible to 
write a book where all these problems and questions are discussed in detail. 
Every month new issues of Physical Review D, Astrophysical Journal, and 
other physical and astrophysical journals add scores of new publications on 
the subject of black holes...” Although Frolov and Novikov have also black 
hole-like solutions in superstring and other theories on their minds, we would 
not probably be much in error, in particular in the context of astrophysics, 
if we would claim the same just about Kerr black holes. Hence, first of all, 
we must refer to the same literature as in the previous Sect. 4.1. A few more 
references will be given below. 

A remarkable fact which stands at the roots of these developments is that 
the wave equation is separable, and the geodesic equations are integrable in 
the Kerr geometry. Carter [79], who explicitly demonstrated the separability 
of the Hamilton- Jacobi equation governing the geodesic motion, has empha- 
sized that one can in fact derive the Kerr metric as the simplest nonstatic 
generalization of the Schwarzschild solution, by requiring the separability of 
the covariant Klein-Gordon wave equation.^® 

A thorough and comprehensive analysis of the behaviour of freely falling 
particles in the Kerr field would produce material for a book. We refer to e.g. 
[18,76,91,134,138,144,147] for fairly detailed accounts and a number of further 
references. From the point of view of astrophysical applications the following 
items appear to be most essential: in contrast to the Schwarzschild case, 
where the stable circular orbits exist only up to r = 3r+ = 6M, in the field of 
rotating black holes, the stable direct (i.e. with a positive angular momentum) 
circular orbits in the equatorial plane can reach regions of “deeper potential 
well”. With an extreme Kerr black hole the last stable direct circular orbit 
occurs at r = r+ = M. (See [147] for a clear discussion of the positions of the 
innermost stable, innermost bound, and photon orbits as the hole becomes 
extreme and a long cylindrical throat at the horizon develops.) A “spin- orbit- 
coupling” effect increases the binding energy of the direct orbits and decreases 
the binding energy of the retrograde (with a negative angular momentum) 
orbits relative to the Schwarzschild values. The binding energy of the last 
stable direct circular orbit is AE = 0.0572/i (/i is the particle’s proper mass) 
in the Schwarzschild case, whereas AE = 0.4235/r for an extreme Kerr hole. 
A particle slowly spiralling inward due the emission of gravitational waves 

Although this is still not a “constructive, analytic derivation of the Kerr metric 
which would fit its physical meaning” , as required by Landau and Lifshitz [139] , 
it is certainly more intuitive than the original derivation by Kerr. On the other 
hand, despite various hints like the existence of the Killing tensor field (in addition 
to the Killing vectors) in the Kerr geometry (see e.g. [134]), it does not seem to 
be clear why the Kerr geometry makes it possible to separate these equations. 
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would radiate the total energy equal to this binding energy; hence much more 
- 42% of its rest energy - in the Kerr case. The second significant effect is 
the dragging of the particles moving on orbits outside of the equatorial plane. 
The dragging^"^ will make the orbit of a star around a supermassive black hole 
to process with angular velocity ^ 2J/r^. The star may go through a disk 
around the hole, subsequently crossing it at different places [150]. One can 
also show that as a result of the joint action of the gravomagnetic effect and 
the viscous forces in an accretion disk, the disk tends to be oriented in the 
equatorial plane of the central rotating black hole (the “Bardeen-Petterson 
effect” ) . 

The above examples demonstrate specific effects in the Kerr background 
which very likely play a significant role in astrophysics (see also Sect. 4.3 
below) . The best known process in the field of a rotating black hole is probably 
astrophy sic ally unimportant, but is of principal significance in the black hole 
physics. This is the Penrose process for extracting energy from rotating black 
holes. It is illustrated schematically in Fig. 6: particle A comes from infinity 
into the ergosphere, splits into two particles, B and C. Whereas C is ejected 
back to infinity, B falls inside the black hole. The process can be arranged 
in such a way that particle C comes back to infinity with higher energy than 
with which particle A was coming in. The gain in the energy is caused by the 
decrease of rotational energy of the hole. Such process is possible because the 
Killing vector k becomes spacelike in the ergosphere, so that the (conserved) 
energy of particle B “as measured at infinity” (see e.g. [18]), Eb = —kapf^, 
can be negative. Unfortunately, the “explosion” of particle B requires such a 
big internal energy that the process is not realistic astrophy sic ally. 

More general considerations of the interaction of black holes with matter 
outside have led to the formulation of the four laws of black hole thermody- 
namics [19,76,82,125]. These issues, in particular after the discovery of the 
Hawking effect that black holes emit particles thermally with temperature 
T = Kh/2-Kkc (re-surface gravity, fc-Boltzmann’s constant), have been an in- 
spiration in various areas of theoretical physics, going from general relativity 
and statistical physics, to quantum gravity and string theory (see [125,126] 
and some remarks and references in Sect. 3.2). The Kerr solution played 
indeed the most crucial role in these developments. I recall how during my 
visits to Moscow in the middle of the 1970s Zel’dovich and his colleagues were 
somewhat regretfully admitting that they were on the edge of discovering the 
Hawking effect. They realized that an analogue of the Penrose process occurs 
with the waves (in so called superradiant scattering) which get amplified if 
their energy per unit angular momentum is smaller than the angular velocity 

Relativists often consider the effects produced by moving mass currents as “the 
dragging of inertial frames” , but the concept of the gravomagnetic field, or gravo- 
magnetism has some advantages, as has been stressed recently [148]. The gravo- 
magnetic viewpoint, however, has also been used in many works in the past - 
see, e.g. [71,75], and in particular [149] and references therein. 
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f? of the horizon. Zel’dovich then suggested that there should be spontaneous 
emission of particles in the corresponding modes but did not study quantum 
fields on a nonrotating background (cf. an account of these developments, 
including the visit of Hawking to Moscow in 1973, by Israel [70]). 

Returning back to Earth or, rather, up to heavens, it is not so well known 
that an astrophysically more realistic example of the Penrose process exists: 
this is the Blandford-Znajek mechanism - see [75,88,151] - in which a mag- 
netic field threading a rotating hole (the field being maintained by external 
currents in an accretion disk, for example) can extract the hole’s rotational 
energy and convert it into a Poynting flux and electron-positron pairs. A 
Kerr black hole with angular momentum parallel to an external magnetic 
field acts (by “unipolar induction”) like a rotating conductor in an external 
field. There will be an induced electric field and a potential difference be- 
tween the pole and the equator. If these are connected, an electric current 
will flow and power will be dissipated. In fact, this appears to be until now 
the most plausible process to explain gigantic relativistic jets emanating from 
the centres of some of the most active galaxies. The HZ-mechanism has its 
problems: extremely rotating black holes expel magnetic flux [75,152] - there 
probably exists a value of the angular momentum Jo < Jmax for which the 
power extracted will be greatest. It is not clear whether the process can be ef- 
ficiently maintained [153]; and perhaps more importantly, new astrophysical 
estimates of seed magnetic fields seem to be too low to make the mechanism 
efficient [154]. The HZ-mechanism will probably attract more attention in 
the coming years, in particular in view of the recent discovery of two “mi- 
croquasars” in our own Galaxy, which generate double radio structures and 
apparent superluminal jets similar to extragalactic strong radio sources [155]. 

A remarkable achievement of pure mathematical physics, with a great 
impact on astrophysics, has not only been the discovery of the Kerr solution 
itself but also the development of the theory of Kerr metric perturbations 
[75,76,91,156]. By employing the Newman-Penrose null tetrad formalism, 
invented and extensively used in mathematical relativity, in particular in 
gravitational radiation theory, it has been possible to separate completely 
all perturbation equations for non-zero spin fields. In particular, a single 
“master equation” ~ called the Teukolsky equation - governs scalar, electro- 
magnetic and gravitational perturbations of a Kerr black hole.^® If no sources 
are present on the right-hand side, the equation looks as follows: 
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In the case of a Kerr-Newman black hole, the electromagnetic and gravitational 
perturbations necessarily couple. Until now, in contrast to the spherical Reissner- 
Nordstrom case, a way of how to decouple them has not been discovered. 
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The coordinates are the Boyer-Lindquist coordinates used in (20), A is de- 
fined in (21), and s is the spin weight of the perturbing field; s = 0,±1,±2. 
The variables in the Teukolsky equation can be separated by decomposing ij) 
according to 



where sSim are so called spin weighted spheroidal harmonics. By solving the 
radial Teukolsky equation for sRim with appropriate boundary conditions one 
can find answers to a number of (astro) physical problems of interest like the 
structure of stationary electromagnetic or gravitational fields due to external 
sources around a Kerr black hole (e.g. [75,157]), the emission of gravitational 
waves from particles plunging into the hole (e.g. [76,97,158]), or the scattering 
of the waves from a rotating black hole (e.g. [76,156] and references therein). 
At present, the Teukolsky equation is being used to study the formation of a 
rotating black hole from a head-on collision of two holes of equal mass and 
spin, initially with small separation, to find the wave forms of gravitational 
radiation produced in this process [159]. The first studies of second-order 
perturbations of a Kerr black hole are also appearing [160]. 

To find all gravitational (metric) perturbations by solving the complete 
system of equations in the Newman-Penrose formalism is in general a formi- 
dable task. As Chandrasekhar’s “last observation” at the end of his chapter on 
gravitational perturbations of the Kerr black hole reads [91]: “The treatment 
of the perturbations of the Kerr spacetime in this chapter has been prolixious 
in its complexity. Perhaps, at a later time, the complexity will be unravelled 
by deeper insights. But mean time, the analysis has led us into a realm of 
the rococo: splendorous, joyful, and immensely ornate.” 

4.3 Astrophysical Evidence for a Kerr Metric 

Very recently new observations seem to have opened up real possibilities of 
testing gravity in the strong-field regime. In particular, it appears feasible 
to distinguish the Kerr metric from the Schwarzschild, i.e. to measure a/M . 
Our following remarks on these developments are based on the review by M. 
Rees [88], and in particular, on the very recent survey by A. Fabian [161], 
an authority on diagnosing relativistic rotation from the character of the 
emission lines of accretion disks around black holes. 

The interest here is not in optical lines since the optical band comes from 
a volume much larger than the hole. However, the X-rays are produced in 
the innermost parts of an accretion flow, and should thus display substantial 
gravitational redshifts as well as Doppler shifts. This only became possible 
to observe quite recently, when the ASCA X-ray satellite started to operate. 
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and the energy resolution and sensitivity became sufficient to analyze line 
shapes. 

Typically the profile of a line emitted by a disk from gas orbiting around 
a compact object has a double-horned shape. The disk can be imagined to 
be composed of thin annuli of orbiting matter - the total line is then the 
sum of contributions from each annulus. If the disk is not perpendicular to 
our line of sight its approaching sides will - due to classical Doppler shifts - 
produce blue peaks, receding sides red peaks. The broadest parts of the total 
line come from the innermost annuli because the motion there is fastest. In 
addition, there are relativistic effects: they imply that the emission is beamed 
in the direction of motion, transverse Doppler shifted, and gravitationally 
redshifted. As a result, the total line is broad and skewed in a characteristic 
manner. Such lines are seen in the X-ray spectra of most Seyfert 1 galaxies. 
In the Seyfert galaxy MCG-6-30-15 the fluorescent iron line was observed to 
be (red)shifted further to lower energies.^® 




Energy (keV) 

Fig. 7. The broad iron line from MCG-6-30-15. The best-fitting, maximally spin- 
ning Kerr black hole model is shown (from Iwasawa, K. et al. (1996), Mon. Not. 
Roy. Astron. Soc. 282, 1038). 



This suggests that the emission took place below 3i?g = 6M, i.e. below the 
innermost stable orbit for a Schwarzschild black hole. In 1996, the line shape 
was well fitted by the assumption that the line is produced in a close orbit 

In Seyfert 1 galaxies hard flares occur which irradiate the accretion disk, and 
produce a reflection component of continuum peaking at ~ 30 keV and the fluo- 
rescent iron line at about 6.5 keV. 
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around maximally rotating (extreme) Kerr black hole. In 1997, the param- 
eter a/M was quantified as exceeding 0.95. Hence, it has been “tentatively 
concluded that the line was the first spectroscopic evidence for a Kerr hole” 
[161]. 

There are alternative models for a broad skew iron line, including Comp- 
tonization by cold electrons, or the emission from irradiated matter falling 
from the inner edge of the disk around a nonrotating Schwarzschild black hole. 
It appears, however, that the data speak against these possibilities [161]. In 
any case, with future X-ray detectors, which will yield count rates orders 
of magnitude higher than ASCA, the line shapes should reveal in a much 
greater detail specific features of the Kerr metric. 

In addition, other possibilities to determine a/M exist. These include: 

(i) Observations of stars in relativistic orbits going through a disk around a 
supermassive rotating black hole [88,150]. 

(ii) Characteristic frequencies of the vibrational modes in disks or tori around 
rotating black holes [88,162]. 

(iii) The precession of a disk which is tilted with respect to the hole’s spin 
axis. This precession arises because of frame dragging and produces a periodic 
modulation of the X-ray luminosity. 

(iv) Astrophysically most important would be a discovery showing that the 
properties of cosmic jets depend on the value of a/M . This could indicate 
that the Blandford-Znajek mechanism (see Sect. 4.2) is really going on. Its 
likelihood would increase if jets were found with Lorentz factors 7 significantly 
exceeding 10 (see [88] for more details). 

(v) Last but not least, future observations of gravitational waves from black 
hole collisions [97] offer great hopes of a clean observation of a black hole 
geometry, without astrophysical complications. 

It is hard to point out any other exact solution of Einstein’s field equations 
(or of any kind of field equations?) discovered in the second half of the 20th 
century which has had so many impacts on so many diverse areas of physics, 
astrophysics, astronomy, and even space science as has had the Kerr metric. 



5 Black Hole Uniqueness 

and Multi-black Hole Solutions 

Since black holes can be formed from the collapse of various matter configu- 
rations, it is natural to expect that there will be many solutions of Einstein’s 
equations describing black holes. It is expected that the asymptotic final 
state of a collapse can be represented by a stationary spacetime, i.e. one 
which admits a 1-dimensional group of isometries whose orbits are timelike 
near infinity. Strong arguments show [26] that the event horizon of a station- 
ary black hole must be a Killing horizon. One of the most remarkable and 
surprising results of black hole theory are the sequence of theorems showing 
rigorously that the only stationary solution of the Einstein electrovacuum 
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equations that is asymptotically flat and has a regular event horizon is the 
Kerr-Newman solution. There is a number of papers on this issue ~ recent 
detailed reviews are given in [80,81]. The intuition gained from exact black 
hole solutions in proving the theorems has been essential. 

Roughly speaking, the uniqueness proof consists of the following three 
parts. First, one demonstrates the “rigidity theorem”, which claims that 
nondegenerate (k 7^ 0) stationary electrovacuum analytic black holes are 
either static or axially symmetric. One then establishes that the Reissner- 
Nordstrom nondegenerate electrovacuum black holes are all static (nonro- 
tating) nondegenerate black holes in electrovacuum. Finally, one separately 
proves that the nondegenerate Kerr-Newman black holes represent all non- 
degenerate axially symmetric stationary electrovacuum black holes. 

Although such results were proved more than 10 years ago, recently there 
has been new progress in the understanding of the global structure of station- 
ary black holes. Again, exact solutions have been inspiring: by gluing together 
two copies of the Kerr spacetime in a certain way, Chrusciel [80] constructed a 
black hole spacetime which is stationary but not axisymmetric, demonstrat- 
ing thus that the standard formulation and proof of the rigidity theorem [26] 
is not correct. (The reason being essentially that when one extends the isome- 
tries from a neighbourhood of the horizon by analytic continuation one has no 
guarantee that the maximal analytic extension is unique.) Chrusciel proved 
“a corrected version of the black hole rigidity theorem”; in the connected 
case one can prove a uniqueness theorem for static electrovacuum black holes 
with degenerate horizons. The uniqueness theorem for static degenerate black 
holes which demonstrates that the extreme Reissner-Nordstrom black hole is 
the only case, is of importance also in string theory. The most unsatisfactory 
feature of the rigidity theorem is the assumption of analyticity of the metric 
in a neighborhood of the event horizon. In this context, Chrusciel [80] men- 
tions the case of Robinson-Trautman exact analytic metrics, which can be 
smoothly but not analytically extended through an event horizon [163]. We 
shall discuss this issue in somewhat greater detail in Sect. 10. 

The black hole uniqueness theorems indicated above are concerned with 
only single black holes. (Corresponding spacetimes contain an asymptotically 
flat hypersurface S with compact interior and compact connected boundary 
dS which is located on the event horizon.) Consequently a question naturally 
arises as to whether one can generalize the theorems to some multi-black 
hole solutions. In classical physics a solution exists in which a system of n 
arbitrarily located charged mass points with charges Qi and masses m^, such 
that l^i] = '/Grrii, is in static equilibrium. In relativity the metric 



ds^ = —V “^dt^ + {dx^ + dy^ + dz^) , 
with time-independent V satisfying Laplace’s equation 
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is a solution of the Einstein-Maxwell equations with the electric field 



where V = {d/dx, d/dy, d/dz). (In standard units E = '/GW~^.) The sim- 
plest solution of this form is the Majumdar-Papapetrou metric, correspond- 
ing to a linear combination of n “monopole sources” with masses rrii > 0 and 
charges qi = rrii, located at arbitrary points Xi. 



Hartle and Hawking [164] have shown that every such spacetime can be an- 
alytically extended to a spacetime representing n degenerate charged black 
holes in static equilibrium. The points x = are actually event horizons of 
area 47 tto^. For the case of one black hole, the metric (30) is just the extreme 
Reissner-Nordstrom black hole in isotropic coordinates. 

A uniqueness theorem for the Majumdar-Papapetrou metrics is not avail- 
able, although some partial answers are known (see [165,166] for more de- 
tails). It is believed that these are the only asymptotically fiat, regular multi- 
black hole solutions of the Einstein-Maxwell equations. In fact, such a result 
would exclude an interesting possibility that a repulsive gravitational spin- 
spin interaction between two (or more) rotating, possibly charged, black holes 
can overcome their gravitational attraction and thus that there exists in Ein- 
stein’s theory of gravitation - in contrast to Newton’s theory ~ a stationary 
solution of the two-body problem. 

Among new solutions discovered by modern generating techniques, there 
are the solutions of Kramer and Neugebauer which represent a nonlinear su- 
perposition of Kerr black holes (see [167] for a review). These solutions have 
been the subject of a number of investigations which have shown that spin- 
spin repulsion is not strong enough to overcome attraction. In particular, 
two symmetrically arranged equal black holes cannot be in stationary equi- 
librium. The situation might change if one considers two Kerr-Newman black 
holes [168]. Here one has four forces to reckon with: gravitational and electro- 
magnetic Coulomb- type interactions, and gravitational and electromagnetic 
spin-spin interactions. One can then satisfy the conditions which render the 
system of two Kerr-Newman black hole free of singularities on the axis, and 
make the total mass of the system positive. However, there persists a singu- 
larity in the plane of symmetry away from the axis [168]. In view of this result 
we have conjectured that even with electromagnetic forces included one can- 
not achieve balance for two black holes, except for the exceptional case of two 
nonrotating extreme (degenerate) Reissner-Nordstrom black holes. Recently, 
some new rigorous results concerning the (non)existence of multi-black hole 
stationary, axisymmetric electrovacuum spacetimes have been obtained [169] 
(see also [80]), but the “decisive theorem” is still missing. 
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In connection with the problem of the balance of gravity by a gravitational 
spin-spin interaction, we should mention that there exists the solution of 
Dietz and Hoenselaers [170] in which balance of the two rotating “particles” 
is achieved. However, the “sources” are complicated naked singularities which 
become Curzon-Chazy “particles” (see Sect. 6.1) if the rotation goes to zero, 
and it is far from clear whether appropriate physical interior solutions can 
be constructed. 

In 1993, Kastor and Traschen [171] found an interesting family of solutions 
to the Einstein-Maxwell equations with a non-zero cosmological constant 
A. They describe an arbitrary number of charged black holes in a “back- 
ground” de Sitter universe. In the limit of H = 0 these solutions become 
Majumdar-Papapetrou static metrics. In contrast to these metrics, the cos- 
mological multi-black hole solutions with H > 0 are dynamical. Remarkably, 
one can construct solutions which describe coalescing black holes. In some 
cases cosmic censorship is violated ~ a naked singularity is formed as a result 
of the collision [172]. Although these solutions do not have smooth horizons, 
the singularities are mild, and geodesics can be extended through them. The 
metric is always at least C^. Since the solutions are dynamical, one may in- 
terpret the non-smoothness of the horizons as a consequence of gravitational 
and electromagnetic radiation. In this sense, the situation is analogous to the 
case of the Robinson-Trautman spacetimes discussed in Sect. 10. In five or 
more dimensions, however, one can construct static multi-black hole solutions 
with A = 0, which do not have smooth horizons [173]. The solutions of Kas- 
tor and Traschen also inspired a new and careful analysis [174] of the global 
structure of the Reissner-Nordstrom-de Sitter spacetimes characterized by 
mass, charge, and cosmological constant. The structure is considerably richer 
than that with A = 0. Most recently, the hoop conjecture (giving the crite- 
rion as to whether a black hole forms from a collapsing system) was discussed 
[175] by analyzing the solution of Kastor and Traschen. 

6 On Stationary Axisymmetric Fields 
and Relativistic Disks 

6.1 Static Weyl Metrics 

The static axisymmetric vacuum metrics in Weyl’s canonical coordinates p £ 
[0, oo), z,t G M,ip G [0, 27t) have the form 

ds^ = e~^^ [e^^ {dp^ -\- dz^) -\- p^dp^]^ — e^’^dt^. (34) 

The function U{p,z) satisfies flat-space Laplace’s equation 

d^u im d^U 

dp"^ p dp dz"^ 
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The function k{p, z) is determined from U by quadrature up to an addi- 
tive constant. The axis p = 0 is free of conical singularities at places where 
limp_>o k = 0. 

The mathematically simplest example is the Curzon-Chazy solution in 
which U = —mj^y is the Newtonian potential of a spherical point 
particle. The spacetime, however, is not spherically symmetric. In fact, one 
of the lessons which one has learned from this solution is the directional char- 
acter of the singularity at p^ z'^ = 0. For example, the limit of the invariant 
RaP'ysR°‘^'^^ depends on the direction of approach to the singularity. The 
singularity has a character of a ring through which some timelike geodesics 
may pass to a Minkowski region [176]. 

Various studies of the Weyl metrics indicated explicitly how important it 
is always to check whether a result is not just a consequence of the choice 
of coordinates. There is the subclass of Weyl metrics generated by the New- 
tonian potential of a constant density line mass (“rod”) with total mass M 
and (coordinate) length I, which is located along the z-axis with the middle 
point at the origin. These are Darmois-Zipoy-Vorhees metrics, called also the 
7 -metrics [64]. The Schwarzschild solution (a spherically symmetric metric!) 
is a special case in this subclass: it is given by the potential of the rod with 
I = 2M. Clearly, in general there is no correspondence between the geometry 
of the physical source and the geometry of the Newtonian “source” from the 
potential of which a Weyl metric is generated. 

A survey of the best known Weyl metrics, including some specific solutions 
describing fields due to circular disks is contained in [64]. More recently, 
Bicak, Lynden-Bell and Katz [177] have shown that most vacuum static Weyl 
solutions, including the Curzon and the Darmois-Vorhees-Zipoy solutions, 
can arise as the metrics of counterrotating relativistic disks (see [177] also for 
other references on relativistic disks). The simple idea which inspired their 
work is commonly used in Newtonian galactic dynamics [6]: imagine a point 
mass placed at a distance b below the centre p = 0 of a plane z = 0. This gives 
a solution of Laplace’s equation above the plane. Then consider the potential 
obtained by reflecting this z > 0 potential in z = 0 so that a symmetrical 
solution both above and below the plane is obtained. It is continuous but 
has a discontinuous normal derivative on z = 0, the jump in which gives 
a positive surface density on the plane. In galactic dynamics one considers 
general line distributions of mass along the negative z-axis and, employing 
the device described above, one finds the potential-density pairs for general 
axially symmetric disks. In [178], an infinite number of new static solutions 
of Einstein’s equations were found starting from realistic potentials used to 
describe flat galaxies, as given recently by Evans and de Zeeuw [179]. 

Although these disks are Newtonian at large distances, in their central 
regions interesting relativistic features arise, such as velocities close to the 
velocity of light, and large redshifts. In a more mathematical context, some 
particular cases are so far the only explicit examples of spacetimes with a 
“polyhomogeneous” null infinity (cf. [180] and Sect. 9), and spacetimes with 
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a meaningful, but infinite ADM mass [178]. New Weyl vacuum solutions 
generated by Newtonian potentials of fiat galaxies correspond to both finite 
and semi-infinite rods, with the line mass densities decreasing according to 
general power laws. It is an open question what kinds of singularities rods 
with different density profiles represent. 

Very recently, new interesting examples of the static solutions describing 
self-gravitating disks or rings, and disks or rings around static black holes 
have been constructed [181-183] and the effects of the fields on freely moving 
test particles studied [181]. Exact disks with electric currents and magnetic 
fields have also been considered [184]. 

Employing the Weyl formalism, one can describe nonrotating black holes 
strongly distorted by the surrounding matter. The influence of the matter can 
be so strong that it may even cause the horizon topology to be changed from 
spherical to toroidal (see [75] and references therein). 

Finally, we have to mention two solutions in the Weyl class, which were 
found soon after the birth of general relativity, and have not lost their in- 
fluence even today. The first, discovered by Bach and Weyl, is assigned by 
Bonnor [64] as “probably the most perspicacious of all exact solutions in 
GR”. It refers to two Curzon-Chazy “monopoles” on the axis of symmetry. 
One finds that the metric function k has the property that limp_>o fc 0, so 
that there is a stress described by a conical singularity between the particles, 
which holds particles apart. A similar solution can be constructed for the 
Schwarzschild “particles” (black holes) held apart by a stress. These cases 
can serve as one of the simplest demonstrations of the difference between 
the Einstein theory and field theories like the Maxwell theory: it is only in 
general relativity in which field equations involve also equations of motion. 

The second “old” solution which has played a very significant role is the 
metric discovered by Levi-Civita. It belongs to the class of degenerate (type 
D) static vacuum solutions which form a subclass of the Weyl solutions. In the 
invariant classification of the degenerate solutions by Ehlers and Kundt [53] , 
this solution is contained in the last, third subclass. That is why Ehlers and 
Kundt called it the C-metric, and it is so well known today. We shall discuss 
the G-metric later (Sect. 11) in greater detail since, as it has been learned in 
the 1970s, it is actually a radiative solution representing uniformly accelerated 
black holes. What Levi-Civita found and Ehlers and Kundt analyzed is only 
a portion of spacetime in which the boost Killing vector is timelike, and 
the coordinates can thus be found there (analogous to the coordinates in 
a uniformly accelerated frame in special relativity) in which the metric is 
time-independent . 

6.2 Relativistic Disks as Sources of the Kerr Metric 
and Other Stationary Spacetimes 

Thanks to the black hole uniqueness theorems (Sect. 5), the Kerr metric rep- 
resents the unique solution describing all rotating vacuum black holes. Nev- 
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ertheless, although the cosmic censorship conjecture, on which the physical 
relevance of the Kerr metric rests, is a very plausible hypothesis, it remains, 
as was noted in several places above, one of the central unresolved issues in 
relativity. It would thus support the significance of the Kerr metric if a phys- 
ical source were found which produces the Kerr field. The situation would 
then resemble the case of the spherically symmetric Schwarzschild metric 
which can represent both a black hole and the external field due to matter. 

This has been realized by many workers. The review on the “Sources for 
the Kerr Metric” [185] written in 1978, contains 71 references, and concludes 
with: “Destructive statements denying the existence of a material source for 
the Kerr metric should be rejected until (if ever) they are reasonably justi- 
fied.” The work from 1991 gives “a toroidal source”, consisting of “a toroidal 
shell ..., a disk ...and an annulus of matter interior to the torus” [186]. 
The masses of the disk and annulus are negative. To summarize in Hermann 
Bondi’s way, the sources suggested for the Kerr metric have not been the 
easiest materials to buy in the shops . . . 

The situation is somewhat different in the special case of the extreme 
Kerr metric, where there is a definite relationship between mass and angular 
momentum. The numerical study [187] of uniformly rotating disks indicated 
how the extreme Kerr geometry forms around disks in the “ultrarelativistic” 
limit. These numerical results have been supported by important analytical 
work (see Sect. 6.3). However, in the case of a general Kerr metric physical 
sources had not been found before 1993. 

A method similar to that of constructing disk sources of static Weyl space- 
times (described in Sect. 5.1) has been shown to work also for axisymmetric, 
reflection symmetric, and stationary spacetimes [188,189]. It is important to 
realize that although now no metric function solves Laplace’s equation as 
in the static case, we may view the procedure described in Sect. 5.1 as the 
identification of the surface z = b with the surface 2 = —b. The field then 
remains continuous, but the jump of its normal derivatives induces a matter 
distribution in the disk which arises due to the identification of the surfaces. 
What remains to be seen, is whether the material can be “bought in the 
shops” . This idea can be employed for all known asymptotically flat station- 
ary vacuum spacetimes, for example for the Tomimatsu-Sato solutions, for 
the “rotating” Curzon solution, or for other metrics (cf. [61] for references). 

Any stationary axisymmetric vacuum metric can be written in canonical 
coordinates {t,tp,p,z) in the form [61] 

(dp^ -I- dz^) + p^dp^)] — {dt + Ad<p)^ , (36) 

where U, k, and A are functions of p, z. For the Kerr solution (mass M, 
specific angular momentum a > 0), the functions U,k,A are ratios of poly- 
nomials when expressed in spheroidal coordinates [61]. 

Now, identify the “planes” z = b = constant > 0 and z = —b (this iden- 
tification leads to disks with zero radial pressure). With the Kerr geometry 



The Role of Exact Solutions 



59 



the matching is more complicated than in the static cases, and therefore, 
one has to turn to Israel’s covariant formalism (see [190] for its recent exposi- 
tion) . Using this formalism one is able to link the surface stress-energy tensor 
of the disk arising from this identification, to the jump of normal extrinsic 
curvature across the timelike hypersurface given hy z = b (with the jump be- 
ing determined by the discontinuities in the normal derivatives in functions 
U,k,A). 

The procedure leads to physically plausible disks made of two streams 
of collisionless particles, that circulate in opposite directions with differen- 
tial velocities [188,189]. Although extending to infinity, the disks have fi- 
nite mass and exhibit interesting relativistic properties such as high veloci- 
ties, large redshifts, and dragging effects, including ergoregions. Physical disk 
sources of Kerr spacetimes with a? > can be constructed (though these 
are “less relativistic”). And the procedure works also for electrovacuum sta- 
tionary spacetimes. The disks with electric current producing Kerr-Newman 
spacetimes are described in [191], where the conditions for the existence of 
(electro)geodesic streams are also discussed. 

The power and beauty of the Einstein field equations is again illustrated: 
the character of exact vacuum fields determines fully the physical character- 
istics of their sources. In a more sophisticated way, this is seen in the problem 
of relativistic rigidly (uniformly) rotating disks of dust. 

6.3 Uniformly Rotating Disks 

The structure of an infinitesimally thin, finite relativistic disk of dust par- 
ticles which rotate uniformly around a common centre was first explored 
by J. Bardeen and R. Wagoner 25 years ago [187]. By developing an effi- 
cient expansion technique in the quantity 5 = 2c/(l + -Zc), Zc denoting the 
central redshift, they obtained numerically a fairly complete picture of the 
behaviour of the disk, even in the ultrarelativistic regime (<5 — ^ 1). In their 
first letter from 1969 they noted that “there may be some hope of finding an 
analytic solution” . Today such a hope has been substantiated, thanks to the 
work of G. Neugebauer and R. Meinel (see [192,193] and references therein). 
The solution had, in fact, to wait until the “soliton-type-solution generating 
techniques” for nonlinear partial differential equations had been brought over 
from applied mathematics and other branches of physics to general relativity, 
starting from the end of the 1970s. 

These techniques have been mainly applied only in the vacuum cases so 
far, but this is precisely what is in this case needed: the structure of the 
thin disk enters the field equations only through the boundary conditions at 
z = 0, 0 < p < a (a is the radius of the disk). The specific procedures which 
enabled Neugebauer and Meinel to tackle the problem are sophisticated and 
lengthy. Nevertheless, we wish to mention them telegraphically at least, since 
they represent the first example of solving the boundary value problem for a 
rotating object in Einstein’s theory by analytic methods. 
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In the stationary axisymmetric case, Einstein’s vacuum field equations 
for the metric (18) imply the well-known Ernst equation (see e.g. [61]) - a 
nonlinear partial differential equation for a complex function / of p and z: 



(Re/) 

where the Ernst potential 



f,pp + f,z 



f{p, z) = e 



^/.P =fl + f% 


(37) 


+ ib, 


(38) 



with U{p,z) being the function entering the metric (36), function b(p,z) is a 
“potential” for A{p,z) in (37), 



A^p = pe A^^ = -pe ^^b^p, (39) 

and the last function k{p, z) in (37) can be determined from U and b by 
quadratures. 

The Ernst equation can be regarded as the integrability condition of a 
system of linear equations for a complex matrix which is a function of 
p + iz, p — iz, and of a (new) complex parameter A. Knowing <T, one can de- 
termine / from ^ at A = 1. Now the problem of solving the linear system can 
be reformulated as the so called Riemann-Hilbert problem in complex func- 
tion theory. (This, very roughly, means the following: let K be a closed curve 
in the complex plane and F{K) a matrix function given on K] find a matrix 
function which is analytic inside L, and analytic outside K such 
that 'Pin'Pout = F on K.) The Riemann-Hilbert problem can be formulated 
as an integral equation. The hardest problem with which Neugebauer and 
Meinel were faced was in connecting the specific physical boundary values of 
/ on the disk with the functions entering the Riemann-Hilbert problem (with 
contour K being determined by the position of the disk in the p, z plane), 
and with the corresponding integral equation. The fact that they succeeded 
and found the solution of their integral equation is a remarkable achievement 
in mathematical physics. The gravitational field and various physical char- 
acteristics of the disk (e.g. the surface density) are given up to quadratures 
in terms of ultraelliptic functions [192], which can be numerically evaluated 
without difficulties. This result, however, may appear as a “lucky case”: it 
does not imply that one will be able to tackle similarly more complicated sit- 
uations as, for example, thin disks with pressure, with non-uniform rotation, 
or 3-dimensional rotating bodies such as neutron stars. 

Many physical characteristics of uniformly rotating relativistic disks such 
as their surprisingly high binding energies, the high redshifts of photons emit- 
ted from the disks, or the dragging of inertial frames in the vicinity of the 
disks, were already obtained with remarkable accuracy in [187], as the ex- 
act solution now verifies. Here we only wish to demonstrate the fundamental 
difference between the Newtonian and relativistic case, as it is illustrated 
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GM^/cJ 

Fig. 8. The general relativistic ( “Einsteinian” ) thin disk of rigidly rotating dust 
constructed by Neugebauer and Meinel, compared with the analogous disk in New- 
tonian theory. If the angular momentum is too low, the disk forms a rotating (Kerr) 
black hole. (From [193].) 



in Fig. 8. The rigidly rotating disk of dust of Neugebauer and Meinel repre- 
sents the relativistic analogue of a classical Maclaurin disk. For the Maclaurin 
disk, it is easy to show that the (dimensionless) quantities y = 2GQM/c^ and 
X = GM^/cJ {M and J are the total mass and angular momentum respec- 
tively, and n is the angular velocity) are related hy y = (97r^/125)x^. For a 
fixed M the angular velocity G ^ y can be increased arbitrarily, with J being 
correspondingly decreased. For relativistic disks, however, there is an upper 
bound on 17 given by Slmax = c^/2GM, whereas J is restricted by the lower 
bound Jmin = GM'^ /c. With an angular momentum too low, a rigidly rotat- 
ing disk cannot exist. If we “prepare” such a disk, it immediately begins to 
collapse and forms - assuming the cosmic censorship - a rotating Kerr black 
hole with x = GM^/cJ > 1. (Notice that the assumption of rigid rotation 
is here crucial: the differentially rotating disks considered in the preceding 
section can have an arbitrary value of x.) Since one can define the angular 
velocity f2{M, J) of the horizon of a Kerr hole, one may consider y{x) for 
black hole states with a; > 1 (cf. Fig 8). The rigidly rotating disk states and 
the black hole states just “meet” at y = a; = 1. In the ultrarelativistic limit 
the gravitational field outside the disk starts to be unaffected by the detailed 
structure of the disk - it approaches the field of an extremely rotating Kerr 



62 



Jin Bicak 



black hole with x = 1. Such a result had already been obtained by Bardeen 
and Wagoner. However, it is only now, with the exact solution available, that 
it can be investigated with full rigor. It gives indirect evidence that Kerr 
black holes are really formed in the gravitational collapse of rotating bodies. 

As noticed also by Bardeen and Wagoner, in the ultrarelativistic limit 
the disk itself “becomes buried in the horizon of the extreme Kerr metric, 
surrounded by its own infinite, non asymptotically flat universe” (see [194] for 
a recent detailed analysis of the ultrarelativistic limit). Similar phenomena 
arise also in the case of some spherical solutions of Einstein-Yang-Mills-Higgs 
equations (cf. [195] and Sect. 13). 

7 Taub-NUT Space 

The name of this solution of vacuum Einstein’s equations fits both to the 
names of its discoverers (Taub-Newman-Unti-Tamburino) and to its curious 
properties. Owing to these properties (which induced Misner [196] to consider 
the solution “as a counterexample to almost anything”), this spacetime has 
played a significant role in exhibiting the type of effects that can arise in 
strong gravitational fields. 

Taub [197] discovered an empty universe with four global Killing vectors 
almost half a century ago, during his pioneering study of metrics with several 
symmetries. By continuing the Taub universe through its horizon one arrives 
in NUT space. NUT space, however, was only discovered in 1963 by a different 
method [198]. In fact, it could have been obtained earlier by applying the 
transformation given in Jurgen Ehlers’ dissertation [54] and his talk at the 
GR2 conference in Royaumont in 1959 [55]. This transformation gives the 
recipe for obtaining stationary solutions from static ones. How the NUT 
space can be obtained by applying this transformation to the Schwarzschild 
metric was demonstrated explicitly by Ehlers at GR4 in London in 1965 [56]. 

7.1 A New Way to the NUT Metric 

Here we shall briefly mention a simple, physically appealing new derivation 
of the NUT metric given recently by Lynden-Bell and Nouri-Zonoz (LBNZ) 
[199]. Their work also shows how even uncomplicated solutions may still be 
of interest in unexpected contexts. LBNZ’s inspiration to study the NUT 
space has in fact come from Newton’s Principia! In one of his scholia Newton 
discusses motion under the standard central force plus a force which is normal 
to the surface swept out by the radius vector to the body which is describing 
the non-coplanar path. A simple interesting case is the motion of mass ttiq 
satisfying the equation 



mo — = -V[r) r+ — V X Bg, 



(40) 



The Role of Exact Solutions 



63 



where v = rjr, 

Bg = -Q f/r^, Q = Q c/mo, (41) 

Q and Q are constants, and c is the velocity of light. Here we write c explicitly 
though c = 1, to make the analogy with magnetism. Indeed, Bg is the field of 
a “gravomagnetic” monopole of strength Q. The classical orbits of particles 
lie on cones which, if the monopole is absent, flatten into a plane [199]. 

It was known that NUT space corresponds to the mass with a gravomag- 
netic monopole, but this was never used in such a physical way as by LBNZ 
for its derivation. The main point is to start from the well-known split of the 
stationary metrics as described in Landau and Lifshitz [139] (see [200] for a 
covariant approach, and the contribution of Beig and Schmidt in the present 
volume) 

ds^ = —e~'^''{dt — Aidx'^)^ + jijdx^dx^ , (42) 

where are independent of t. This form is unique up to the choice 

of time zero: t = t + x(a:*) implies again the metric in the form (42) in 
{t ,x®), with the “vector potential” undergoing a gauge transformation A[ = 
+ ViX. Writing down the equation of motion of a test particle in metric 
(42), in analogy with the equation of motion of a charged particle in an 
electromagnetic field, one is naturally led to define the “gravoelectric” and 
“gravomagnetic” fields by 



Eg = \/v, Bg = \7xA, (43) 

where “Vx” is with respect to Following the problem of §95 in [139] 
one then rewrites all Einstein’s equations in terms of the fields (43), the 
metric and their derivatives. To find the vacuum spherically symmetric 
spatial y-metric one takes 'yijdx^dx^ = dr'^ + r'^ {dO^ + sir? 9 dip^), and one 
assumes ix = v{r), and Bg = —Qe~^jr'^. The Einstein equations then imply 
the spacetime metric, which is not spherically symmetric, in the form 

ds^ = ^dt — 2q {1 + cos 9) d(fi^ +{l — q^/r^) ^e^‘'dr^ 

+ + sin^ 9 dp'^) , (44) 

where q = Qj2 = constant, 

e~^'' = 1 — 2r“^ (^q^ + m\/r‘^ — ^ (45) 

and the vector potential Ag, = 2q{ \ + cos 9) satisfies (43). (The factor (1 — 
q^ jr"^) should be raised to the power —1 in equation (3.22) in [199], as it 
is clear from (3.20).) Equation (44) is the NUT metric, with r being the 
curvature coordinate of spheres r = constant. With g = 0 the metric (44) 
becomes the Schwarzschild metric in the standard Schwarzschild coordinates. 
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More commonly the metric (44) is written in the form 
/ 9 

ds^ = —Vidi+Aq sin^ 2 ^ ’ 



^= 1-2 

_l_ q-i 

which can be obtained from (44) by putting 



= \/r'^ - 






i = t — Aqip. 



(46) 

(47) 



(48) 



Recently, Ehlers [201] considered a Newtonian limit of NUT space within 
his frame theory which encompasses general relativity and Newton-Cartan 
theory (a slight generalization of Newton’s theory). The main purpose of 
Ehlers’ frame theory is to define rigorously what is meant by the statement 
that a one parameter family of relativistic spacetime models converges to a 
Newton-Cartan model or, in particular, to a strictly Newtonian model. 

The strictly Newtonian limit occurs when the Coriolis angular velocity 
field u), related to the connection coefficients in the Newton-Cartan the- 
ory, depends on time only. NUT spacetimes approach a truly Newton-Cartan 
limit with spatially non-constant radial Coriolis field to’’ = —q/f'^, which in 
this limit coincides with the Newtonian gravomagnetic field. As in the anal- 
ogous classical problem with the equation of motion (40), the geodesics in 
NUT space lie on cones. This result has been used to study gravitational leas- 
ing by gravomagnetic monopoles [199]: they twist the rays that pass them in 
a characteristic manner, different from that due to rotating objects. 

The metrics (44) and (46) appear to have a preferred axis of fixed points 
of symmetry. This is a false impression since we can switch the axis into any 
direction by a gauge transformation. For example, the metric (44) has a coni- 
cal singularity at 0 = 0 but is regular at 0 = tt, whereas the metric (46) has a 
conical singularity at 0 = tt but is regular at 0 = 0. The metrics are connected 
by the simple gauge transformation, i.e. t ^ t = t — Aqip. A mass endowed 
with a gravomagnetic monopole appears as a spherically symmteric object 
but the spacetime is not spherically symmetric according to the definition 
given in Sect. 2.1. Nevertheless, there exist equivalent coordinate systems in 
which the axis can be made to point in any direction - just as the axis of the 
vector potential of a magnetic monopole can be chosen arbitrarily. For further 
references on interpreting the NUT metric as a gravomagnetic monopole, see 
[199] and the review by Bonnor [64]. 



7.2 Taub-NUT Pathologies and Applications 

By introducing two coordinate patches, namely the coordinates of metric (44) 
to cover the south pole {6 = tt) and those of (46) the north pole (0 = 0), the 
rotation axis can be made regular. However since (p is identified with period 
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27t, equation (48) implies that t and t have to be identified with the period 
8nq. Then observers with (f, 9, ip) = constant follow closed timelike lines if V 
in (47) is positive, i.e. if r > ho = m + (m^ + q^) 3 . The hypersurface r = fo is 
the null hypersurface - horizon, below which lines (t, 6, ip) = constant become 
spacelike. Because of the periodic identification of t and t, the hypersurfaces 
of constant r change the topology from 5^ x to S^, on which t/2q, 9, p 
become Euler angle coordinates. 

The region with E < 0 is the Taub universe: it has homogeneous but 
non-isotropic space sections r = constant. The coordinate f, allowed to run 
from — oo to -|-oo, is a timelike coordinate, and is naturally denoted by t in 
the Taub region. 

In addition to the closed timelike lines in the NUT region there are fur- 
ther intriguing pathologies exhibited by the Taub-NUT solutions. Here we 
just list some of them and refer to the relevant literature [26,27,196]. The 
Taub region is globally hyperbolic: its entire future and past history can be 
determined from conditions given on a spacelike Cauchy hypersurface. How- 
ever, this is not the case with the whole Taub-NUT spacetime. As in the 
Reissner-Nordstrom spacetimes (Sect. 3), there are Cauchy horizons H^{S) 
of a particular spacelike section S of maximal proper volume lying between 
the globally hyperbolic Taub regions and the causality violating NUT re- 
gions. H^{S) are smooth, compact null hypersurfaces diffeomorphic to 
- the generators of such null surfaces are closed null geodesics. The Taub 
region is limited between < t < t+, where t± are roots of V in equa- 
tion (47) (with the interchange t f). This region is compact but there 
are timelike and null geodesics which remain within it and are not complete. 
(See [27] for a nice picture of these geodesics spiralling around and approach- 
ing 77+ (A) asymptotically.) This pathological behaviour of “the incomplete 
geodesics imprisoned in a compact neighbourhood of the horizon” was inspi- 
rational in the definition of singularities [77] - one meets here the example in 
which the geodesic incompleteness is not necessarily connected with strong 
gravitational fields. It can be shown, however, that after an addition of even 
the slightest amount of matter this pathological behaviour will not take place 
“ true singularities arise. 

This enables one to consider the time between and as the lifetime 
of the Taub universe. Wheeler [202] constructed a specific case of the Taub 
universe which will live as long as a typical Friedmann closed dust model 
(^ 10^*^ years) but will have a volume at maximum expansion smaller by 
a factor of 5 x 10^°. This example thus appears to be a difficulty for the 
anthropic principle. 

Taub space seems also to be the only known example giving the possibility 
of making inequivalent NUT-like extensions which lead to a non-Hausdorff 
spacetime manifold [26,200,203]. 

The Taub-NUT solution plays an important role in cosmology and quan- 
tum gravity. Here we wish to note yet two other recent applications of this 
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space. About ten years ago, interest was revived in closed timelike lines, time 
machines, and wormholes. One of the leaders in this activity. Kip Thorne, 
explains in [204] in a pedagogical way the main recent results on closed time- 
like curves and wormholes by using “Misner space” ~ Minkowski spacetime 
with identification under a boost, which Misner introduced as a simplified 
version of Taub-NUT space. 

The second application of the Taub-NUT space is still more remarkable - 
it plays an important role outside general relativity. The asymptotic motion of 
monopoles in (super-)Yang-Mills theories corresponds to the geodesic motion 
in Euclidean Taub-NUT space [205]. Euclidean Taub-NUT spaces have been 
discussed in many further works on monopoles in gauge theories. One of 
the latest of these works [206], on the exact T-duality (which relates string 
theories compactified on large and small tori) between Taub-NUT spaces and 
so called “calorons” (instantons at finite temperature defined on x 5^), 
gives also references to previous contributions. 

8 Plane Waves and Their Collisions 

8.1 Plane-Fronted Waves 

The history of gravitational plane waves had began already by 1923 with 
the paper on spaces conformal to fiat space by Brinkmann. Interest in these 
waves was revived in 1937 by Rosen, and in the late 1950s by Bondi, Pirani 
and Robinson, Holy, and Peres (see [53,61] for references). A comprehensive 
geometrical approach to these spacetimes soon followed in the classical trea- 
tise by Jordan, Ehlers and Kundt [57], and in the subsequent well-known 
chapter by Ehlers and Kundt [53] . As an application of various newly devel- 
oped methods to analyze gravitational radiation, and as a simple background 
to test various physical theories, plane waves have proved to be a useful and 
stimulating arena which offers interesting contests even today, as we shall 
indicate by a few examples in Sect. 8.2. 

Consider a congruence of null geodesics (rays) x°"{v) such that dx°"/dv = 
k°‘, fco,/c“ = 0, = 0, V being an affine parameter. In general a geodesic 

congruence is characterized by its expansion 9, shear |cr| and twist oj given 
by (see e.g. [19]) 

0 = , |a| = (49) 

^ (50) 

According to the definition given by Ehlers and Kundt [53] a vacuum 
spacetime is a “plane-fronted gravitational wave” if it contains a shearfree 
|cr| = 0 geodesic null congruence, and if it admits “plane wave surfaces” 
(spacelike 2-surfaces orthogonal to /c“). This definition is inspired by plane 
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electromagnetic waves in Maxwell’s theory. Electromagnetic plane waves are 
null fields (“pure radiation fields”): there exists a null vector tangent to 
the rays, which is transverse to the electromagnetic field Fap, i.e. = 0, 

= 0, and the quadratic invariants of which vanish, FapF°‘^ = 0 = 
FapF*^l^, where F*^ is dual to F^is. Analogously, Petrov type N gravita- 
tional fields (see [61]) are null fields with rays tangent to k°‘ (the “quadru- 
ple Debever-Penrose null vector”), and with the Riemann tensor satisfying 
Rap'isk^ = 0, Rai 3 'ysR°‘^'^^ = 0, and Ra^^yS — Then the Bianchi 

identities and the Kundt-Thompson theorem for type N solutions in vacuum 
spacetimes (also more generally, under the presence of a nonvanishing cos- 
mological constant) imply that the shear of must necessarily vanish (see 
[61,207]). Because of the existence of plane wave surfaces, the expansion (49) 
and twist (50) must vanish as well, 0 = tu = 0. In this way we arrive at 
the Kundt class of nonexpanding, shearfree and twistfree gravitational waves 
[61]. The best known subclass of these waves are “plane-fronted gravitational 
waves with parallel rays” (pp-waves) which are defined by the condition that 
the null vector k°‘ is covariantly constant, ka-^i^ = 0. Thus, automatically 
is the Killing vector, and 0 = |cr| = a; = 0. 

Ehlers and Kundt [53] give several equivalent characterizations of the 
pp-waves and show, following their previous work [57], that in suitable null 
coordinates with a null coordinate u such that k^ = £md = {d/dv)°‘, 
the metric has the form 

ds^ = 2d(^dQ — 2dudv — 2i7(u, C, ^)dv? , (51) 

where F[ is a, real function dependent on u, and on the complex coordinate C 
which spans the wave 2-surfaces u = constant, v = constant. These 2-surfaces 
with Euclidean geometry are thus contained in the wave hypersurfaces u = 
constant and cut the rays given by (u, Q = constant, v changing. The vacuum 
field equations imply 2-dimensional Laplace’s equation 

H,cc = 0, (52) 

so that we can write 

2i7 = /(u,C) + /(u,C), (53) 

where f{u, ^) is an arbitrary function of u, analytic in (). To characterize the 
curvature in the waves and their effect on test particles it is convenient to 
introduce the null complex tetrad, such that at each spacetime point, together 
with the preferred null vector A:“, we have a null vector F,l°‘ka = — 1, and 
complex spacelike vector m“ satisfying mafh°^ = l,mcck°‘ = ma_l°‘ = 0. For 

This algebraic (local) analogy between null fields exists also between electromag- 
netic and gravitational shocks (possible discontinuities across null hypersurfaces), 
and in the asymptotic behaviour of fields at large distances from sources (the 
“peeling property” - see e.g. [19,27]). 
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the metric (51) the only nonvanishing projection of the Weyl (in the vacuum 
case, the Riemann) tensor onto this tetrad is the (Newman-Penrose) scalar 

tf'4 = Caf3-y5l°‘rh^Pm^ = H , (54) 

which denotes a transverse component of the wave propagating in the 
direction. As shown by Ehlers and Kundt [53] (see also e.g. [61]), though in 
a somewhat different notation, we can use again an analogy with the elec- 
tromagnetic field - described for an analogous plane wave by the transverse 
component (j )2 = FapfhPP - and write = Ae*®, where real A > 0 is con- 
sidered as the amplitude of the wave, and at each spacetime point associate 
0 with the plane of polarization. Vacuum pp-waves with 0 = constant are 
called linearly polarized. 

Consider a free test particle (observer) with 4-velocity u and a neighbour- 
ing free test particle displaced by a “connecting” vector Introducing 

then the physical frame ep) which is connected with the observer such that 
6(0) = u and 6(j) are connected with the null tetrad vectors by 

m ^ (e(i) - *e(2)) , 

1= ^("-®(3))> k= ^(u + 6(3)), 

we find that the equation of geodesic deviation in spacetime with only tfq 0 
implies (see [207]) 

ZP') = -A+Z^F + A,,ZP\ ZP'> = A+ZP'> + A^Z^F^ ^(3) = 0, (56) 

where A+ = | Re tfti, Ax = \ Im are amplitudes of “-I-” and “x” polar- 
ization modes, and are the frame components of the connecting vector 
Z. Since the frame vector 6(3) is chosen in the longitudinal direction (the di- 
rection of the rays), equation (56) clearly exhibits the transverse character of 
the wave. If particles, initially at rest, lie in the (e(]^),e(2)) plane, there is no 
motion in the longitudinal direction of 6(3). The ring of particles is deformed 
into an ellipse, the axes of different polarizations are shifted one with respect 
to the other by J (such behaviour is typical for linearized gravitational waves 

- cf. e.g. [18]). Making a rotation in the transverse plane by an angle d, 

e'(i) = cos'de(i) -I- sini?e(2) , e'(2) =— sin i? 6(i) -|- cost? 6(2) , (57) 

and taking d = 'd+(r) = — | Arg tf'4 = —\0, then A'^_ = A(^ = 0 

- the wave is purely “-I-” polarized. If 0 = constant, the rotation angle is 
independent of time - the wave is rightly considered as linearly polarized. 

Hence, with the discovery of pp-waves, the understanding of the proper- 
ties of gravitational radiation has become deeper and closer to physics. In 
addition, the pp-waves can easily be “linearized” by taking the function H 
in the metric (51) to be so small that the spacetime can be considered as a 
perturbation of Minkowski space within the linearized theory. Such an “easy 
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way” from the linear to fully nonlinear spacetimes is of course paid by their 
simplicity. 

In general, in fact, the pp- waves have only the single isometry generated by 
the Killing vector k°‘ = (9/9n)“. However, a much larger group of symmetries 
may exist for various particular choices of the function Jordan, 

Ehlers and Kundt [57] (see also [53,61]) gave a complete classification of the 
pp-waves in terms of their symmetries and corresponding special forms of H . 
For example, in the best known case of plane waves to which we shall turn 
in greater detail below, is independent of so that after removing linear 
terms in C by a coordinate transformation, we have 

H{uXX) = A{u)C^ + A{u)C\ (58) 

with A{u) being an arbitrary function of u. This spacetime admits five Killing 
vectors. 

Recently, Aichelburg and Balasin [208,209] generalized the classification 
given in [57] by admitting distribution-valued profile functions and allowing 
for non- vacuum spacetimes with metric (51), but with H which in general 
does not satisfy (52). They have shown that with H in the form of delta-like 
pulses, 

H{uX,0 = nC0S{u), (59) 

new symmetry classes arise even in the vacuum case. 

The main motivation to consider impulsive pp-waves stems from the met- 
rics describing a black hole or a “particle” boosted to the speed of light. The 
simplest metric of this type, given by Aichelburg and Sexl [210], is a Schwarz- 
schild black hole with mass m boosted in such a way that fi = mj \/\ — uA 
is held constant as w — >■ 1. It reads 

= 2dfd( — 2dudv — ‘ip,log{f()S{u)du^ , (60) 

with H clearly in the form (59). This is not a vacuum metric: the energy- 
momentum tensor Tq,^ = p,5(u)5{C,)kakp indicates that there is a “point-like 
particle” moving with the speed of light along u = 0. The Aichelburg-Sexl 
metric and its more recent generalizations have found interesting applications 
even outside of general relativity. Some of them will be briefly mentioned in 
Sect. 8.2. 

Let us now turn to the simplest class of pp-waves, which comprises of the 
best known and illuminating examples of exact gravitational waves. These are 
the plane waves. They are defined as homogeneous pp-waves in the sense that 
the curvature component lAi (see (54)) is constant along the wave surfaces so 
that function H is in the form (58). One can write H as in (53) where 

/(z.,C)= ^A(u)e*®(“)C^ ( 61 ) 

with linear terms being removed by a coordinate transformation. Just as a 
plane electromagnetic wave, a plane gravitational wave is thus completely 
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represented by its amplitude A{u) and polarization angle 0{u) as functions 
of the phase u. 

The plane waves, including their generalization into the Einstein-Maxwell 
theory (an additional term then appears in H, both 'F 4 and the elec- 

tromagnetic quantity <l >2 being independent of C)j were already studied in 
1926 (see [61]). A real understanding however came only in the late 1950s. 
Ehlers and Kundt [53] give various characterizations of this class. For ex- 
ample, they prove that a non- flat vacuum field is a pp-wave if and only if 
the curvature tensor is complex recurrent, i.e. if Pa/ 3 'yS,n = Pa/ 3 'ysq^i, where 
PajS-fS = Ra/ 3 'yS + * Rap-yS] and it is a plane wave if and only if the re- 
currence vector is collinear with a real null vector. They also state a 
nice theorem showing that the plane wave spacetimes defined by the met- 
ric (51), H and / given by (53), (69), C, = x + iy, and with coordinate 
ranges —00 < x,y,u,v < 00 , are geodesically complete if functions A{u) and 
0{u) are C^-functions. Quoting directly from [53], “there exist ... complete 
solutions free of sources (singularities), proving to think of a graviton field 
independent of any matter by which it be generated. This corresponds to the 
existence of source-free photon fields in electrodynamics”. Ehlers and Kundt 
[53] also state an open problem which, as far as I am aware, has not yet 
been solved: to prove that plane waves are the only geodesically complete 
pp- waves. 

The most telling examples of plane waves are sandwich waves. The ampli- 
tude A{u) in (61) need not be smooth: either it can only be continuous and 
nonvanishing on a finite interval of u (sandwich), or a step function (shock), 
or a delta function (impulse). A physical interpretation of such waves is bet- 
ter achieved in other coordinate systems, in which the metric “before” and 
“after” the wave is not Minkowskian but has a higher degree of smooth- 
ness. For linearly polarized waves {0 equal to zero), a convenient coordinate 
system can be introduced by setting (see e.g. [211]) ( = {l/fr 2 ){px iqy), 
V = (1/2) (t -I- 2 : -I- pp'x"^ qq'y"^), u = t — z, where ' = d/du, and functions 
p = p{u) and q = q{u) solve equations p" A{u)p = 0 and q" — A{u)q = 0. 
In these coordinates the metric turns out to be 

= —df^ -\- p^dx^ -\- q^dy^ -\- dz^. (62) 

In double-null coordinates u, v, with u = u = t— z,v = t-\-z, and with a 
general polarization, the metric can be cast into the form (see e.g. [65,212]) 

= — dudD -I- e“^(e'^ cosh lEda;^-|-e“'^ coshkEdy^ — 2sinh kkdxd?/), (63) 

where U, V, W depend on u only. This so called Rosen form was used in the 
classical paper on exact plane waves by Bondi, Pirani and Robinson [213]. 

A simple, textbook example [214] of a sandwich wave is the wave with a 
“square profile”: A{u) = 0 for m < 0 and u > a"^,A{u) = = constant for 

0 < u < a^. The functions p and q which enter (62) are then p = q = 1 at 
u < 0,p = cos(u/a),g = cosh(u/a) at 0 < m < a^, and p = — (u/a) sin a -I- 
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constant, (j = (u/a) sinha + constant at < u. This example can be used 
to demonstrate explicitly various typical features of plane sandwich gravita- 
tional waves within the exact theory: (i) the wave fronts travel with the speed 
of light; (ii) the discontinuities of the second derivatives of the metric tensor 
are permitted along a null hypersurface, but must have a special structure; 
(iii) the waves have a transverse character and produce relative accelera- 
tions in test particles; (iv) the waves focus astigmatically initially parallel 
null congruences (rays) that are pointing in other directions than the waves 
themselves; (v) as a consequence of the focusing, Rosen-type line elements 
contain coordinate singularities on a hypersurface behind the waves, and in 
general caustics will develop there [214]. 

The focusing effects imply a remarkable property of plane wave space- 
times: no spacelike global hypersurface exists on which initial data can be 
specified, i.e. plane wave spacetimes contain no global Cauchy hypersurface. 
This can be understood from Fig. 9. Considering a point Q in flat space in 
front of the wave, Penrose [215] has shown that its future null cone is dis- 
torted as it passes through the wave in such a manner that it is refocused to 
either a point i? or a line passing through R parallel to the wave front. Any 
possible Cauchy hypersurface going through Q must lie below the future null 
cone through Q, i.e. below the past null cone of R. Hence, it cannot extend 
as a spacelike hypersurface to spatial infinity. 




Fig. 9. The future null cone of the event Q is distorted as it passes through the 
plane wave, and refocused at the event R in such a manner that no Cauchy initial 
hypersurface going through Q exists. (From [215].) 



8.2 Plane-Fronted Waves: New Developments and Applications 

The interest in impulsive waves generated by boosting a “particle” at rest to 
the velocity of light by means of an appropriate limiting procedure persists 
up to the present. The ultrarelativistic limits of Kerr and Kerr-Newman 
black holes were obtained in [216-218], and recently, boosted static multipole 
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(Weyl) particles were studied [219]. Impulsive gravitational waves were also 
generated by boosting the Schwarzschild-de Sitter and Schwarzschild~anti de 
Sitter metrics to the ultrarelativistic limit [220,221]. 

These types of spacetimes, especially the simple Aichelburg-Sexl metrics, 
have been employed in current problems of the generation of gravitational 
radiation from axisymmetric black hole collisions and black hole encounters. 
The recent monograph by d’Eath [222] gives a comprehensive survey, includ- 
ing the author’s new results. There is good reason to believe that spacetime 
metrics produced in high speed collisions will be simpler than those corre- 
sponding to (more realistic) situations in which black holes start to collide 
with low relative velocities. The spacetimes corresponding to the collisions at 
exactly the speed of light is an interesting limit which can be treated most 
easily. Aichelburg-Sexl metrics are used to describe limiting “incoming states” 
of two black holes, moving one against the other with the speed of light. An 
approximation method has been developed in which a large Lorentz boost is 
applied so that one has a weak shock propagating on a strong shock. One 
finds an estimate of 16.8 % for the efficiency of gravitational wave generation 
in a head-on speed-of- light collision [222] . 

Great interest has been stimulated by ’t Hooft’s [223] work on the quan- 
tum scattering of two pointlike particles at centre-of-mass energies higher or 
equal to the Planck energy. This quantum process has been shown to have 
close connection with classical black hole collisions at the speed of light (see 
[222,224] and references therein). 

Recently, the Colombeau algebra of generalized functions, which enables 
one to deal with singular products of distributions, has been brought to gen- 
eral relativity and used in the description of impulsive pp-waves in various 
coordinate systems [225] , and also for a rigorous solution of the geodesic and 
geodesic deviation equations for impulsive waves [226]. The investigation of 
the equations of geodesics in non-homogeneous pp-waves (with / ^ C^) has 
shown that the motion of test particles in these spacetimes is described by 
the Henon-Heiles Hamiltonian which implies that the motion is chaotic [227]. 

Plane-fronted waves have been used as simple metrics in various other 
contexts, for example, in quantum field theory on a given background (see 
[228] for recent work), and in string theory [229]. As emphasized very recently 
by Gibbons [230], since for pp-waves and type A Kundt’s class (see the be- 
ginning of Sect. 8.1) all possible invariants formed from the Weyl tensor and 
its covariant derivatives vanish [231], these metrics suffer no quantum correc- 
tions to all loop orders. Thus they may offer insights into the behaviour of a 
full quantum theory. The invariants vanish also in type III spacetimes with 
nonexpanding and nontwisting rays [232]. 

8.3 Colliding Plane Waves 

As with a number of other issues in gravitational (radiation) theory, the 
pioneering ideas on colliding plane gravitational waves are connected with 
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Roger Penrose. It does not seem to be generally recognized that the basic 
idea appeared six years before the well-known paper by Khan and Penrose 
[233] in which the metric describing the general spacetime representing a 
collision of two parallel-polarized impulsive gravitational waves was obtained. 
Having demonstrated the surprising fact that general relativistic plane wave 
spacetimes admit no Cauchy hypersurface due to the focusing effect the waves 
exert on null cones, Penrose [215] (in footnote 12) remarks: “This fact has 
relevance to the question of two colliding weak plane sandwich waves. Each 
wave warps the other until singularities in the wave fronts ultimately appear. 
This, in fact, causes the spacetime to acquire genuine physical singularities in 
this case. The warping also produces a scattering of each wave after collision 
so that they cease to be sandwich waves when they separate (and they are 
no longer plane - although they have a two-parameter symmetry group).” 
The first detailed study of colliding plane waves, independently of Khan 
and Penrose, was also undertaken by Szekeres (see [234,235]). He formulated 
the problem as a characteristic initial value problem for a system of hyper- 
bolic equations in two variables (null coordinates) u, v with data specified on 
the pair of null hypersurfaces u = 0,v = 0 intersecting in a spacelike 2-surface 
(Fig. 10). In the particular case of spacetimes representing plane waves prop- 
agating before the collision in a flat background, Szekeres has shown that 
coordinates (of the “Rosen type” , as known from the case of one wave - see 
Eq. (63)) exist in which the metric reads 

ds^ = — e~^ du dv + 

+ e~^ [ cosh Wdx^ + e~^ cosh Wdy^ — 2 sinh Wdx dy] , (64) 

where M, U , V and W are functions of u and v. Coordinates x and y are 
aligned along the two commuting Killing vectors d/dx and d/dy, which are 
assumed to exist in the whole spacetime representing the colliding waves (cf. 
the note by Penrose above). In almost all recent work on colliding waves, 
region IV in Fig. 10, where m < 0, w < 0, is assumed to be flat. The null lines 
u = 0,v < 0 and ?; = 0, u < 0 are wavefronts, and in regions II {u < 0,v > 0) 
and III (m > 0, ?; < 0) one has the standard plane wave metric corresponding 
to two approaching plane waves from opposite directions. In region //, func- 
tions M, [/, y, W depend on v only, and in region III only on u. The waves 
collide at the 2-surface it = w = 0, in region I they interact. The spacetime 
here can be determined by the initial data posed on the i; > 0 portion of the 
hypersurface it = 0 (which in Fig. 10 are “supplied” by the wave propagating 
to the right) and by the data on the it > 0 portion of the hypersurface v = 0 
(given by the wave propagating to the left) . Unfortunately, the integration of 
such an initial value problem does not seem to be possible for general incom- 
ing wave forms and polarizations. If, however, the approaching waves have 
constant and aligned (parallel) polarizations, one may set the function W = 0 
globally. The solution of the initial value problem then reduces to a one di- 
mensional integral for the function V, and two quadratures for the function 
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M. (The function exp(— C/) must have the form f{u) + g{v) as a consequence 
of the field equations everywhere, and it can be determined easily from the 
initial data.) Despite these simplifications it is very difficult to obtain exact 
solutions in closed analytic form. Szekeres [235] found a solution (as he puts it 
“more or less by trial and error”) which, as special cases, includes the solution 
given by himself earlier [234] and the solution obtained independently and 
simultaneously by Khan and Penrose [233]. Although Szekeres’ formulation 
of a general solution for the problem of colliding parallel-polarized waves is 
difficult to use for constructing other specific explicit examples, it has been 
employed in a general analysis of the structure of the singularities produced 
by the collision [236], which will be discussed in the following. 




Fig. 10. The spacetime diagram indicating the collision of two plane-fronted gravi- 
tational waves which come from regions II and III, collide in region 7, and produce 
a spacelike singularity. Region IV is flat. 



It has also inspired an important, difficult piece of mathematical physics 
which was developed at the beginning of the 1990s in the series of papers by 
Hauser and Ernst [237] . Their new method of analyzing the initial value prob- 
lem can be used also for the case when the polarization of the approaching 
waves is not aligned. They formulated the initial value problem in terms of 
the equivalent matrix Riemann-Hilbert problem in the complex plane. Their 
techniques are related to those used by Neugebauer and Meinel to analyze and 
construct the rotating disk solution as a boundary value problem (Sect. 6.3). 
No analogous solution for colliding waves in the noncollinear case is available 
at present, but investigations in this direction are still in progress. Most re- 
cently, Hauser and Ernst prepared an extensive treatise [238] in which they 
give a general description and detailed mathematical proofs of their study of 
the solutions of the hyperbolic Ernst equation. 

The approach of Khan and Penrose for obtaining exact solutions describ- 
ing colliding plane waves starts in the region I where the waves interact: (i) 
find a solution with two commuting spacelike Killing vectors d/dx and d/dy, 
transform to null coordinates, and look back in time whether this solution 
can be extended across the null hypersurface u = 0, u = 0 so that it describes 
a plane wave propagating in the w-direction in region II and another plane 
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wave propagating in the w-direction in region III ; (ii) satisfy boundary con- 
ditions not only across boundaries between regions I and II, and regions I 
and III, but also across the boundaries between II and IV, and III and IV in 
such a manner that IV is flat. The original prescription of Khan and Penrose 
for extending the solution from region I to regions II and III consists in the 
substitutions uH(u) and vH(v) in place of u and v everywhere in the metric 
coefficients; here H{u) = 1 for it > 0, 77 = 0 for u < 0 is the usual Heaviside 
function. We then get the metric as a function of v (respectively it) in region 
II (respectively III) corresponding to the wave propagating to the right (re- 
spectively to the left) in Fig. 10. Finally, it remains to investigate carefully 
the structure of discontinuities and possible singularities on the null bound- 
aries between these regions. In the original Khan and Penrose solutions the 
Riemann tensor has a 5-function character on the boundaries between II and 
IV, and III and IV] but inside regions II and III themselves the spacetime 
is flat (the collision of impulsive plane waves). In the solution obtained by 
Szekeres [235], regions II and III are not flat, and the Riemann tensor at the 
boundaries between II (respectively III ) and IV is just discontinuous (the 
collision of shock waves) . 

Nutku and Halil [239] constructed an exact solution describing the colli- 
sion of two impulsive plane waves with non-aligned polarizations. In the limit 
of collinear polarizations their solution reduces to the solution of Khan and 
Penrose. All of these solutions reveal that the spacelike singularity always 
develops in region I (given by -|- = I in Fig. 10) - in agreement with the 
original suggestion of Penrose. Moreover, the singularity “propagates back- 
ward” and so called fold singularities, analyzed in detail in 1984 by Matzner 
and Tipler [240], appear also at v = 1 and u = 1 in regions II and III. 
This new type of singularity provides evidence of how even relatively recent 
studies of explicit exact solutions may reveal unexpected global features of 
relativistic spacetimes. 

The remarkable growth of interest in colliding plane waves owes much 
to the systematic (and symptomatic) effort of S. Chandrasekhar who, since 
1984, together with V. Ferrari, and with B. Xanthopoulos, published a num- 
ber of papers on colliding plane vacuum gravitational waves [241,242], and 
on gravitational waves coupled with electromagnetic waves, with null dust, 
and with perfect fluid (see [212] for references). The basic strategy of their 
approach follows that of Khan and Penrose: first a careful analysis of the pos- 
sible solution is done in the interaction region I, and then one works backward 
in time, extending the solutions to regions II, III and IV. 

The main new input consists in carrying over the techniques known from 
stationary, axisymmetric spacetimes with one timelike and one spacelike 
Killing vector to the case of two spacelike Killing vectors, djdx,dldy, and 
exploring new features. 

Taking a simple linear solution of the Ernst equation, E = Prj -\- iQy,, 
where P and Q are real constants which satisfy P^ + = 1, and rj, fi are 

suitable time and space coordinates, Chandrasekhar and Ferrari [241] show 
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that one arrives at the Nutku-Halil solution. In particular, if Q = 0, the Khan- 
Penrose solution emerges. Since by starting from the same simplest form of 
the Ernst function in the axisymmetric stationary case one arrives at the Kerr 
solution (or at the Schwarzschild solution for the real Ernst function), we may 
conclude that in region I the solutions of Khan and Penrose and of Nutku 
and Halil are, for spacetimes with two spacelike Killing vectors, the analogues 
of the Schwarzschild and Kerr solutions. This mathematical analogy can be 
generalized to colliding electromagnetic and gravitational waves within the 
Einstein-Maxwell theory - Chandrasekhar and Xanthopoulos [243] found the 
analogue of the charged Kerr-Newman solution. Such a generalization is also 
of interest from a conceptual viewpoint: the (5-function singularity in the Weyl 
tensor of an impulsive gravitational wave might imply a similar singularity in 
the Maxwell stress tensor, which would seem to suggest that the field itself 
would contain “square roots of the (5-function” . 

In the most important paper [242] of the series, Chandrasekhar and Xan- 
thopoulos, starting from the simplest linear solution for the Ernst conjugate 
function £1+ = Prj + iQ^,P^ + Q'^ = 1, obtained a new exact solution for 
colliding plane impulsive gravitational waves accompanied by shock waves. 
This solution results in the development of a nonsingular Killing-Cauchy 
horizon instead of a spacelike curvature singularity. The metric can be ana- 
lytically extended across this horizon to produce a maximal spacetime which 
contains timelike singularities. (The spacelike singularity in region I in Fig. 
10 is changed into the horizon, to the future of which timelike singularities 
occur.) In the region of interaction of the colliding waves, the spacetime is 
isometric to the spacetime in a region interior to the ergosphere. 

Many new interesting solutions were discovered by using the Khan and 
Penrose approach. In addition, inverse scattering (soliton) methods and other 
tools from the solution generation techniques were applied. They are reviewed 
in detail in [65,212,244]. 

Although very attractive mathematical methods are contained in these 
works, one feels that physical interpretation has receded into the background 
- as seemed to be the case when the new solution generating techniques were 
exploited in all possible directions for stationary axisymmetric spacetimes. It 
is therefore encouraging that a more physical and original approach to the 
problem has been initiated by Yurtsever. In a couple of papers he discusses 
Killing-Cauchy horizons [245] and the structure of the singularities produced 
by colliding plane waves [236]. Similar to the Cauchy horizons in black hole 
physics, one finds that the Killing-Cauchy horizons are unstable. We thus 
expect that the horizon will be converted to a spacelike singularity. By using 
the approach of Szekeres described at the beginning of this section, it is pos- 
sible to relate the asymptotic form of the metric near the singularity - which 
approaches an inhomogeneous Kasner solution (see Sect. 12.1) - to the initial 
data given along the wavefronts of the incoming waves. For specific choices 
of initial data the singularity degenerates into a coordinate singularity and a 
Killing-Cauchy horizon arises. However, Yurtsever ’s analysis [236] shows that 
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such horizons are unstable (within full nonlinear theory) against small but 
generic perturbations of the initial data. These results are stronger than those 
on the instability of the inner horizons of the Reissner-Nordstrom or Kerr 
black holes. In particular, Yurtsever constructs an interesting (though un- 
stable) solution which, when analytically extended across its Killing-Cauchy 
horizon, represents a Schwarzschild black hole created out of the collision be- 
tween two plane sandwich waves propagating in a cylindrical universe [236] . 

Yurtsever also introduced “almost plane wave spacetimes” and analyzed 
collisions of almost plane waves [246]. These waves have a finite but very 
large transverse sizes. Some general results can be proved (for example, that 
almost plane waves cannot have a sandwich character, but always leave tails 
behind), and an order-of-magnitude analysis can be used in the discussion of 
the outcome of the collision of two almost plane waves; i.e. whether they will 
focus to a finite minimum size and then disperse, or whether a black hole 
will be created. Although in the case of almost plane waves one can hardly 
hope to find an exact spacetime in an explicit form, this is a field which was 
inspired by exact explicit solutions, and may play a significant role in other 
parts of general relativity. 

9 Cylindrical Waves 

In 1913, before the final formulation of general relativity, Einstein remarked 
in a discussion with Max Born that, in the weak- field limit, gravitational 
waves exist and propagate with the velocity of light (Poincare pioneered the 
idea of gravitational waves propagating with the velocity of light in 1905 - 
see [15]). Yet, in 1936 Einstein wrote to Born [247]: “... gravitational waves 
do not exist, though they had been assumed a certainty to the first approxi- 
mation. This shows the nonlinear general relativistic field equations can tell 
us more, or, rather, limit us more than we have believed up to now. If only 
it were not so damnably difficult to find rigorous solutions”. However, after 
finding a mistake in his argumentation (with the help of H. Robertson) and 
discovering with Nathan Rosen cylindrical gravitational waves [248] as the 
first exact radiative solutions to his vacuum field equations, Einstein changed 
his mind. In fact, cylindrical waves were found more than 10 years before Ein- 
stein and Rosen by Guido Beck in Vienna [249]. Beck was mainly interested 
in time-independent axisymmetric Weyl fields, but he realized that through 
a complex transformation of coordinates {z it, t ^ iz) one obtains cylin- 
drically symmetric time-dependent fields which represent cylindrical gravi- 
tational waves, and wrote down equations (71) and (72) below. The work 
of Einstein and Rosen is devoted explicitly to gravitational waves. It investi- 
gates conditions for the existence of standing and progressive waves, and even 
notices that the waves carry away energy from the mass located at the axis 
of symmetry. We shall thus not modify the tradition and will call this type 
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of waves Einstein-Rosen waves (which some readers may wish to shorten to 
EROS- waves). 

This type of waves, symmetric with respect to the transformation z — >■ —2 
{z - the axis of symmetry), contains one degree of freedom of the radiation 
field and corresponds to a fixed state of polarization. The metric can be 
written in the form 

ds^ = + dp^) + e'^'^dz^ + p^e-^^dip^, (65) 

where p and t are invariants (“Weyl-type canonical coordinates”), and ip = 
ip{t,p), 7 = 7 (t,p). The Killing vectors d/d(p and d/dz are both spacelike 
and hypersurface orthogonal. 

The metric containing a second degree of freedom was discovered by 
Jurgen Ehlers (working in the group of Pascual Jordan), who used a trick 
similar to Beck’s on the generalized (stationary) Weyl metrics, and indepen- 
dently by Kompaneets (see the discussion in [250]). In the literature (e.g. 
[251,252]) one refers to the Jordan-Ehlers-Kompaneets form of the metric: 

ds^ = [—dt^ + dp^) + {dz + Lodpif' + p'^ dtp'^ . (66) 

Here, the additional function Lo{t, p) represents the second polarization. 

Despite the fact that cylindrically symmetric waves cannot describe ex- 
actly the radiation from bounded sources, both the Einstein-Rosen waves and 
their generalization ( 66 ) have played an important role in clarifying a num- 
ber of complicated issues, such as the energy loss due to gravitational waves 
[253], the interaction of waves with cosmic strings [254,255], the asymptotic 
structure of radiative spacetimes [250], the dispersion of waves [256], testing 
the quasilocal mass-energy [257], testing codes in numerical relativity [251], 
investigation of the cosmic censorship [258] , and quantum gravity in a simpli- 
fied but field theoretically interesting context of midisuperspaces [259-261] . 

In the following we shall discuss in some detail the asymptotic structure 
and midisuperspace quantization since in these two issues cylindrical waves 
have played the pioneering role. Some other applications of cylindrical waves 
will be briefly mentioned at the end of the section. 

9.1 Cylindrical Waves and the Asymptotic Structure 
of 3-Dimensional General Relativity 

In recent work with Ashtekar and Schmidt [262,263], which started thanks 
to the hospitality of Jurgen Ehlers’ group, we considered gravitational waves 
with a space-translation Killing field (“generalized Einstein-Rosen waves”). 
In (2-|-l)-dimensional framework the Einstein-Rosen subclass forms a sim- 
ple instructive example of explicitly given spacetimes which admit a smooth 
global null (and timelike) infinity even for strong initial data. Because of the 
symmetry, the 4-dimensional Einstein vacuum equations are equivalent to 
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the 3-dimensional Einstein equations with certain matter sources. This re- 
sult has roots in the classical paper by Jordan, Ehlers and Kundt [57] which 
includes “reduction formulas” for the calculation of the Riemann tensor of 
spaces which admit an Abelian isometry group. 

Vacuum spacetimes which admit a spacelike, hypersurface orthogonal 
Killing vector djdz can be described conveniently in coordinates adapted 
to the symmetry: 

-I- 3ab(a:)(ix“da;^, a, 6, . . . = 0, 1, 2, (67) 

where x = x“ and gab is a metric with Lorentz signature. The field equations 
can be simplified if one uses a metric in the 3-space which is rescaled by the 
norm of the Killing vector, and writes the norm of the Killing vector as an 
exponential. Then (67) becomes 

ds2 ^ + e-2’^(^)(?„^,(x)dx“dx^ (68) 

and the field equations, 

Rab - = 0, g‘^^VaVb^ = 0, (69) 

where V denotes the derivative with respect to the metric gab, can be rein- 
terpreted as Einstein’s equations in 3 dimensions with a scalar field = '/2tp 
as source. Thus, 4-dimensional vacuum gravity is equivalent to 3-dimensional 
gravity coupled to a scalar field. In 3 dimensions, there is no gravitational 
radiation. Hence, the local degrees of freedom are all contained in the scalar 
field. One therefore expects that Cauchy data for the scalar field will suffice to 
determine the solution. For data which fall off appropriately, we thus expect 
the 3-dimensional Lorentzian geometry to be asymptotically flat in the sense 
of Penrose [27,264], i.e. that there should exist a 2-dimensional boundary 
representing null infinity. 

In general cases, this is analyzed in [262] . Here we shall restrict ourselves 
to the Einstein-Rosen waves by assuming that there is a further spacelike, 
hypersurface orthogonal Killing vector d/d(p which commutes with djdz. 
Then, as is well known, the equations simplify drastically. The 3-metric is 
given by 

= gabdx°‘dx^ = e^~*{—dt^ + dp^) + p^dip^, (70) 

the field equations (69) become 

7' = + 7 = 2pT/>i/;', (71) 

and 

+ + = (72) 

where the dot and the prime denote derivatives with respect to t and p 
respectively. The last equation is the wave equation for the non-flat 3-metric 
(70) as well as for the flat metric obtained by setting 7 = 0. 
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Thus, we can first solve the axisymmetric wave equation (72) for 'ip on 
Minkowski space and then solve (71) for 7 - the only unknown metric coeffi- 
cient ~ by quadratures. The “method of descent” from the Kirchhoff formula 
in 4 dimensions gives the representation of the solution of the wave equation in 
3 dimensions in terms of Cauchy data 'I'q = tp{t = 0, x, y), = 0, x, y) 

(see [262]). We assume that the Cauchy data are axially symmetric and of 
compact support. 

Let us look at the behaviour of the solution at future null infinity . 
Let p, ip be polar coordinates in the plane, and introduce the retarded time 
coordinate u = t—p to explore the fall-off along constant u null hypersurfaces. 
For large p, the function ip at u = constant admits a power series expansion 
in p~^: 



ip{u,p) 



/o(^) ]_ fkju) 

'/P '/Ph'i 



(73) 



The coefficients in this expansion are determined by integrals over the Cauchy 
data. At u ^ po, po being the radius of the disk in the initial Cauchy surface 
in which the data are non-zero, we obtain 



fo{u) — — 7 H r + • ■ ■ ) (74) 

where kg , fci are constants which are determined by the data. If the solution 
happens to be time-symmetric, so that vanishes, we find /o ^ u~^ for 
large u. Similarly, we can also study the behaviour of the solution near the 
timelike infinity of 3-dimensional Minkowski space by setting t = U + up, 
K > 1, and investigating '0 for p — >■ 00 with U and n fixed. We refer to [262] 
for details. 

In Bondi-type coordinates {u = t — p,p,<p), equation (70) yields 

da'^ = e^'^{—du^ — 2dudp) + p^dip^. (75) 

The Einstein equations take the form 

7 .U = 2p '0.u(0,p - V',u), 7 .p = PV'^p, (76) 

and the wave equation on ip becomes 

-2ip^up + '<P,pp + P~^{tp,p-'<p,u) = ^- (77) 



The asymptotic form of ip{t,p) is given by the expansion (73). Since we can 
differentiate (73) term by term, the field equations (76) and (77) imply 



1,U = -2[fo(u)]^ + Y^ 



9k{u) 



(78) 



7.P = 



hk{u) 
„k+2 ’ 
fe =0 ^ 



(79) 
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where the functions fk,hk are products of the functions /o,/fe,/o)/fc- Inte- 
grating (79) and fixing the arbitrary function of u in the result using (78), 
we obtain 



7 = 7o - 2 



r 1 2 ^ 

fo{u) du~Y^ 

-oo , 1 



hkiu) 



(fc -f l)p'=+i ■ 



(80) 



Thus, 7 also admits an expansion in where the coefficients depend 

smoothly on u. It is now straightforward to show that the spacetime ad- 
mits a smooth future null infinity, Setting p = p~^,u = u,(p = ip and 
rescaling pab by a conformal factor = p, we obtain 



da^ = Q'^dcr^ = {—p^du^ + 2dudp) + dip^ , (81) 



where 7 (u, p) = 'y{u, p~^). Because of (80), 7 has a smooth extension through 
p = 0. Therefore, Pab is smooth across the surface p = 0. This surface is the 
future null infinity, . Hence, the (2-|-l)-dimensional curved spacetime has 
a smooth (2-dimensional) null infinity. Penrose’s picture works for arbitrarily 
strong initial data tf'o, 

Using (81), we find that at we have: 



7 (m, 00 ) = 7 o - 2 / f^du. (82) 



Since one can make sure that 7 = 0 at i"*" [263], one finds the simple result 
that 



f* + 00 



7o = 2 



fodu. 



(83) 



At spatial infinity (t = constant, p — ^ 00 ), the metric is given by 

{-df + dp^) + p^dp^. (84) 



For a non-zero data, constant 70 is positive, whence the metric has a “coni- 
cal singularity” at spatial infinity. This conical singularity, present at spatial 
infinity, is “radiated out” according to equation (82). The future timelike in- 
finity, i’*', is smooth. In (2-|-l)-dimensions, modulo some subtleties [262], equa- 
tion (82) plays the role of the Bondi mass loss formula in (3-|-l)-dimensions, 
relating the decrease of the total (Bondi) mass-energy at null infinity to the 
flux of gravitational radiation. We can thus conclude that cylindrical waves 
in (2+l)-dimensions pive an explicit model of the Bondi-Penrose radiation 
theory which admits smooth null and timelike infinity for arbitrarily stronp 
initial data. There is no other such model available. The general results on 
the existence of in 4 dimensions, mentioned at the end of Sect. 1.3, assume 
weak data. 

It is of interest to investigate cylindrical waves also in a (3-|-l)-dimensional 
context. The asymptotic behaviour of these waves was discussed by Stachel 
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[250] many years ago. However, his work deals solely with asymptotic direc- 
tions, which are perpendicular to the axis of symmetry, i.e. to the djdz - 
Killing vector. Detailed calculations show that, in contrast to the perpendic- 
ular directions, where null infinity in the (3-|-l)-dimensional framework does 
not exist, it does exist in other directions for data of compact support. If the 
data are not time-symmetric, the fall-off is so slow that (local) null infinity 
has a polyhomogeneous (logarithmic) character [180] - see [263] for details. 

We have concentrated on the simplest case of Einstein-Rosen waves. They 
served as a prototype for developing a general framework to analyze the 
asymptotic structure of spacetime at null infinity in three spacetime dimen- 
sions. This structure has a number of quite surprising features which do not 
arise in the Bondi-Penrose description in four dimensions [262]. One of the 
motivations for developing such a framework is to provide a natural point 
of departure for constructing the stage for asymptotic quantization and the 
S-matrix theory of an interesting midisuperspace in quantum gravity. 

9.2 Cylindrical Waves and Quantum Gravity 

As the editors of the Proceedings of the 117th WE Heraeus Seminar on 
canonical gravity in 1993 [265], Jurgen Ehlers and Helmut Friedrich start 
their Introduction realistically: “When asking a worker in the field about the 
progress in quantum general relativity in the last decade, one shouldn’t be 
surprised to hear: ‘We understand the problems better’. If it referred to a 
lesser task, such an answer would sound ironic. But the search for quantum 
gravity... has been going on now for more than half a century and in spite 
of a number of ingenious proposals, a satisfactory theory is still lacking...” 
Although I am following the subject from afar, I believe that one would 
not be too wrong if one repeated the same words in 1999. However, apart 
from general theoretical developments, many interesting quantum gravity 
models have been studied, and exact solutions have played a basic role in 
them. In particular, in the investigations of (spherical) gravitational collapse 
and in quantum cosmology based typically on homogeneous cosmological 
models (cf. Sect. 12.1), one starts from simple classical solutions - see e.g. 
[266-268] for reviews and [269] for a bibliography up to 1990. A common 
feature of such models is the reduction of infinitely many degrees of freedom 
of the gravitational field to a finite number. In quantum field theory (such 
as quantum electrodynamics) a typical object to be quantized is a wave with 
an infinite number of degrees of freedom. The first radiative solutions of 
the gravitational field equations which were subject to quantization were 
the Einstein-Rosen waves. Kuchaf [259] applied the methods of canonical 
quantization of gravity to these waves, using the methods employed earlier in 
the minisuperspace models, i.e. restricting himself only to geometries (fields) 
preserving the symmetries. 

The Einstein-Rosen cylindrical waves have an infinite number oo^ of de- 
grees of freedom contained in one polarization, one degree of freedom for each 
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cylindrical surface drawn around the axis of symmetry. Moreover, the slicing 
of spacetime by spacelike (cylindrically symmetric) hypersurfaces is not fixed 
completely by the symmetry ~ an arbitrary cylindrically symmetric defor- 
mation of a given slice leads again to an allowed slice. Such a deformation 
represents an oo^ “fingered time”. Hence, the resulting space of 3-geometries 
on cylindrically symmetric slices is infinitely richer than the minisuperspaces 
of quantum cosmology. The exact Einstein-Rosen waves thus inspired the 
first construction of what Kuchaf [259] called the ''''midisuperspace''. 

Let us briefly look at the main steps in Kuchaf’s procedure. The sym- 
metry of the problem implies that the spatial metric has the form 

9 ii = 922 = 933 = e'^, (85) 

where 7, and R are functions of a single cylindrical coordinate = r = 
(/?, = z). Similarly the lapse function N = N{r) depends only on r, and the 

shift vector has the only nonvanishing radial component (r) , N'^ = 

= 0. We have adopted here Kuchaf’s notation. When we put R = r = 
p, = 2 ^/), 7 — >■ 27, N = and = 0, we recover the standard 

Einstein-Rosen line element (65); however, in general the radial and time 
coordinates t and r differ from the canonical Einstein-Rosen radial and time 
coordinates in which the metric has the standard form (65). The symmetry 
implies that the canonical momentum is diagonal and expressible by three 
functions tt^j, tt^ of r; for example, , and similarly for the 

other components. After the reduction to cylindrical symmetry, the action 
functional assumes the canonical form 

/ OO ptX) 

dt / dr{7T.y'y + 7T/ji? -|- — NTL — N^TLi) , (86) 

-00 J 0 

in which 'y,R,(l> are the canonical coordinates and tt.^, tt/j, tt^ the conjugate 
momenta (the integration over 2 has been limited by 2 ; = zq £md z = zq + 1 ). 
The superhamiltonian R and supermomentum Ri are rather complicated 
functions of the canonical variables: 

R {-TT.yTTR+\R-^T:l + 2R'' -iR' +\R^'^), (87) 

Ri = — 27 t(^ -I- 7'TT.y -I- R'-kr + (88) 

The most important step now is the replacement of the old canonical 
variables 7, TT.y, R, ttr by a new canonical set T, Ur, R, Ur through a suitable 
canonical transformation. We shall write here only one of its components (see 
[259,270] for the complete transformation): 

T{r)=T{oo)+ f [—TT.y{r)]dr. (89) 

J 00 

For the basic concepts and ideas of canonical gravity, we refer to e.g. [18,19] and 
especially to Kuchaf’s review [270], where the canonical quantization of cylindri- 
cal waves is also analyzed. 
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By integrating the Hamilton equations following from the action (86), rewrit- 
ten in the new canonical coordinates, one finds that T and R are the Einstein- 
Rosen privileged time and radial coordinates, i.e. those appearing in the 
canonical form (65) of the Einstein-Rosen metric (with T = t,R = p). Ac- 
cording to (89), the Einstein-Rosen time can be reconstructed, in a non-local 
way, from the momentum which characterizes the extrinsic curvature of 
a given hypersurface. In this way, the concept of the “extrinsic time repre- 
sentation” entered canonical gravity with cylindrical gravitational waves. 

In terms of the new canonical variables, the superhamiltonian and super- 
momentum become 



Since R and Ri are linear in Ut and 7T/j, the classical constraints R = 
0,Hi = 0 can immediately be resolved with respect to these momenta, con- 
jugate to the “embedding” canonical variables T(r) and i?(r): 



-Rt = {R'^ - T'2) ^ [l(R-\l + R^'^)R' - = 0, (92) 



and similarly for 17^. It is easy to see [259,270] that the constraints have the 
same form as the constraints for a massless scalar field <7> propagating on a flat 
background foliated by arbitrary spacelike hypersurfaces T = T(r), R = R{r). 
The canonical variables 7>, represent the true degrees of freedom, and 
the remaining canonical variables play the role of spacelike embeddings of a 
Cauchy hypersurface into spacetime. 

After turning the canonical momenta TJt, Rr, into variational deriva- 
tives, e.g. Rt = —iS/5T{r), one can impose the classical constraints R = 
0,Ri = 0 as restrictions on the state functional W{T,R,<7): RW = 0, Ri'R = 
0. In particular, the Wheeler-DeWitt equation RW = 0 in the extrinsic time 
representation assumes the form of a many-fingered time counterpart of an or- 
dinary Schrodinger equation. This reduces to the ordinary Schrodinger equa- 
tion for a single massless scalar field in Minkowski space if we adopt the 
standard foliation T = constant (see [259,270] for details). 

The described procedure, first realized in the case of the Einstein-Rosen 
waves, has opened a new route in canonical and quantum gravity. In contrast 
to the Arnowitt-Deser-Misner approach, in which the gravitational dynamics 
is described relative to a fixed foliation of spacetime, in this new approach 
(called “bubble time” dynamics of the gravitational field or the “internal time 
formalism” [271]) one tries to extract the many-fingered time (i.e. embeddings 
of Cauchy hypersurfaces) from the gravitational phase space, but does not fix 
the foliation in the “target manifold” by coordinate conditions. However, the 
definition of the target manifold by a gauge (coordinate) condition is needed. 

This new approach has been so far successfully applied to a few other 
models (based on exact solutions) with infinite degrees of freedom, for ex- 
ample, plane gravitational waves, bosonic string, and as late as 1994, to 



R = {R'Rt + T'Rr + i -k R^'^)) 

•Hi = T'Rt + R'Rr + ^<'7r<z,. 



(90) 

(91) 
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spherically symmetric vacuum gravitational fields [272]. The internal time 
formalism for spacetimes with two Killing vectors was developed in [273] 
(therein references to previous works can also be found). Recently, canonical 
transformation techniques have been applied to Hamiltonian spacetime dy- 
namics with a thin spherical null-dust shell [274] . One would like to construct 
a midisuperspace model of spherical gravitational collapse, or more specifi- 
cally, a model for Hawking radiation with backreaction. The extensive past 
work on Hamiltonian approaches to spherically symmetric geometries (see 
[274] for more than 40 references in this context) have not yet led to con- 
vincing insights. The very basic question of existence of the “internal time” 
formalism in a general situation has been most recently addressed by Hajicek 
[49]; the existence has been proven, and shown to be related to the choice of 
gauge. 

9.3 Cylindrical Waves: a Miscellany 

Chandrasekhar [247] constructed a formalism for cylindrical waves with two 
polarizations (cf. the metric (66)), similar to that used for the discussion of 
the collision of plane-fronted waves (Sect. 8.3). He obtained the “cylindrical” 
Ernst equation and corroborated (following the suggestion of O. Reula) the 
physical meaning of Thorne’s C- energy [253] - the expression for energy sug- 
gested for cylindrical fields - by defining a Hamiltonian density corresponding 
to the Lagrangian density from which the Ernst equation can be derived. A 
brief summary of older work on the mass loss of a cylindrical source radi- 
ating out cylindrical waves and its relation to the C-energy is given in [65]. 
It should be pointed out, however, that although C-energy is a useful quan- 
tity, it was constructed by exploiting the local field equations, without direct 
reference to asymptotics. The physical energy (per unit z length) at both 
spatial and null infinity, which is the generator of the time translation, is in 
fact a non-polynomial function of the C-energy. In the weak field limit the 
two agree, but in strong fields they are quite different [262]. 

In [256], an exact solution was constructed with which one can study the 
dispersion of waves: a cylindrical wave packet, which though initially impul- 
sive, after reflection at the axis disperses, and develops shock wave fronts 
when the original wave meets the waves that are still ingoing. Cylindrical 
waves have been also analyzed in the context of phase shifts occurring in 
gravitational soliton interactions (see [275] and references therein). 

An exact explicit solution for cylindrical waves with two degrees of polar- 
ization has been obtained [252] from the Kerr solution after transforming the 
metric into “cylindrical” coordinates and using the substitution t ^ iz,z ^ 
it, a id. Both this solution and the well-known Weber- Wheeler-Bonnor 
pulse [65] have been employed as test beds in numerical relativity [276], in 
particular in the approach which combines a Cauchy code for determining 
the dynamics of the central source with a characteristic code for determining 
the behaviour of radiation [251]. 
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In a number of works cylindrical waves have beeir considered iir interaction 
with cosmic strings [254,255]. The striirgs are usually modelled as iirfiiritely 
thin conical siirgularities. Receirtly Colombeau’s theory of geireralized func- 
tions was used to calculate the distributional curvature at the axis for a 
time-dependent cosmic string [277]. 

A somewhat surprising result concerning cosmic strings and radiation the- 
ory should also be noted: although an infinite, static cylindrically symmet- 
ric string does not, of course, radiate, it generates a nonvanishing (though 
“non-radiative” ) contribution to the Bondi news function [278,279]. Recently, 
the asymptotics at null infinity of cylindrical waves with both polarizations 
(and, in geireral, air infinite cosmic string along the axis) has been analyzed 
in the context of axisymmetric electrovacuum spacetimes with a translational 
Killing vector at null infinity [280] . 

Finally, the cylindrically symmetric electrovacuum spacetimes with both 
polarizations, satisfying certain completeness and asymptotic flatness condi- 
tions in spacelike directions have been shown rigorously to imply that strong 
cosmic censorship holds [258]. This means that for generic (smooth) initial 
data the maximal globally hyperbolic development of the data is inextendible 
(no Cauchy horizons as for example, those discussed in Sect. 3.1 for the 
Reissner-Nordstrom spacetime arise). This global existence result is non- 
trivial siirce with two polarizations and electromagnetic field present, all field 
equations are ironliirear. 

10 On the Robinson— Trautman Solntions 

Robiirsoir-Trautman metrics are the geireral radiative vacuum solutions which 
admit a geodesic, shearfree and twistfree null congruence of diverging rays. 
In the standard coordinates the metric has the form [281] 

ds^ = 2r^ P~^ dCd( — 2du dr — [A hr P — 2r(ln P)_„ — 2mr~^] du^, (93) 

where C is a complex spatial (stereographic) coordinate (essentially 9 and 
ip), r is the affine parameter along the rays, m is a retarded time, m is a 
function of u (which can be in some cases interpreted as the mass of the 
system), A = 2P'^{d‘^ /d(dQ, and P = P{uX,() satisfies the fourth-order 
Robinson-Trautman equation 

AA{lnP) -|- 12 m (lnP)^„ — = 0. (94) 

The best candidates for describing radiation from isolated sources are the 
Robinson-Trautman metrics of type II with the 2-surfaces given by rt, r = 
constant and having spherical topology. The Gaussian curvature of can 
be expressed as K = Z\lnP. If K = constant, we obtain the Schwarzschild 
solution with mass equal to K~ 2 . 

These spacetimes have attracted increased attention in the last decade - 
most recently in the work by Chrusciel, and Chrusciel and Singleton [282] . In 
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these studies the Robinson- Trautman spacetimes have been shown to ex- 
ist globally for all positive “times”, and to converge asymptotically to a 
Schwarzschild metric. Interestingly, the extension of these spacetimes across 
the “Schwarzschild-like” event horizon can only be made with a finite degree 
of smoothness. All these rigorous studies are based on the derivation and 
analysis of an asymptotic expansion describing the long-time behaviour of 
the solutions of the nonlinear parabolic equation (94). 



r= oo r=0 /+ r= oo 




Fig. 11. The evolution of the cosmological Robinson-Trautman solutions with a 
positive cosmological constant. A black hole with the horizon is formed; at 
future infinity the spacetime approaches a de Sitter spacetime exponentially 
fast, in accordance with the cosmic no-hair conjecture. 



In our recent work [163,283] we studied Robinson-Trautman radiative 
spacetimes with a positive cosmological constant A. The results proving the 
global existence and convergence of the solutions of the Robinson-Trautman 
equation (94) can be taken over from the previous studies since A does not 
explicitly enter this equation. We have shown that, starting with arbitrary, 
smooth initial data at u = Ug (see Fig. 11), these cosmological Robinson- 
Trautman solutions converge exponentially fast to a Schwarzschild-de Sitter 
solution at large retarded times (u — ^ oo). The interior of a Schwarzschild- 
de Sitter black hole can be joined to an “external” cosmological Robinson- 
Trautman spacetime across the horizon with a higher degree of smooth- 
ness than in the corresponding case with A = 0. In particular, in the extreme 
case with 9Am^ = 1, in which the black hole and cosmological horizons coin- 
cide, the Robinson-Trautman spacetimes can be extended smoothly through 
TL^ to the extreme Schwarzschild-de Sitter spacetime with the same values 
of A and m. However, such an extension is not analytic (and not unique). 

We have also demonstrated that the cosmological Robinson-Trautman 
solutions represent explicit models exhibiting the cosmic no-hair conjecture: 
a geodesic observer outside of the black hole horizon will see, that inside 
his past light cone, these spacetimes approach the de Sitter spacetime ex- 



Jin Bicak 



ponentially fast as he approaches the future (spacelike) infinity For a 
freely falling observer the observable universe thus becomes quite bald. This 
is what the cosmic no-hair conjecture claims. As far as we are aware, these 
models represent the only exact analytic demonstration of the cosmic no-hair 
conjecture under the presence of gravitational waves. They also appear to be 
the only exact examples of black hole formation in nonspherical spacetimes 
which are not asymptotically fiat. Hopefully, these models may serve as tests 
of various approximation methods, and as test beds in numerical studies of 
more realistic situations in cosmology. 



11 The Boost— Rotation Symmetric 
Radiative Spacetimes 

In this section we would like to describe briefiy the only explicit solutions 
available today which are radiative and represent the fields of finite sources. 
Needless to say, we cannot hope to find explicit analytic solutions of the 
Einstein equations without imposing a symmetry. A natural first assumption 
is axial symmetry, i.e. the existence of a spacelike rotational Killing vector 
djdLp. However, it appears hopeless to search for a radiative solution with 
only one symmetry. We are now not interested in colliding plane waves since 
these do not represent finite sources; we wish our spacetime to be as “asymp- 
totically fiat as possible”. The unique role of the boost-rotation symmetric 
spacetimes is exhibited by a theorem, formulated precisely and proved for 
the vacuum case with hypersurface orthogonal Killing vectors in [284], and 
generalized to electrovacuum spacetimes with Killing vectors which need not 
be hypersurface orthogonal in [279] (see also references therein). This the- 
orem roughly states that in axially symmetric, locally asymptotically fiat 
spacetimes (in the sense that a null infinity satisfying Penrose’s requirements 
exists, but it need not necessarily exist globally), the only additional symme- 
try that does not exclude radiation is the boost symmetry. 

In Minkowski spacetime the boost Killing vector has the form 

d d 

Cboost = Z— + ! ( 95 ) 

SO that orbits of symmetry to which the Killing vector is tangent are hy- 
perbolas — t^ = B = constant, x, y = constant. Orbits with B > 0 are 
timelike; they can represent worldlines of uniformly accelerated particles in 
special relativity. Imagine, for example, a charged particle, axially symmet- 
ric about the z-axis, moving with a uniform acceleration along this axis. 
The electromagnetic field produced by such a source will have boost-rotation 
symmetry. 

Figure 12 shows two particles uniformly accelerated in opposite direc- 
tions along the z-axis. In the space diagram (left), the “string” connecting 
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the particles is also indicated. In the spacetime diagram, the particles’ world- 
lines are shown in bold. Thinner hyperbolas represent the orbits of the boost 
Killing vector (95) in the regions > z'^ where it is spacelike. In Fig. 13 the 
corresponding compactified diagram indicates that null infinity cannot be 
smooth everywhere since it contains four singular points in which particles’ 
worldlines “start” and “end”. Notice that in electromagnetism the presence 
of two particles, one moving along ^ > 0, the other along z < 0, makes the 
field symmetric also with respect to inversion z — ^ — z. The electromagnetic 
field can be shown to be analytic everywhere, except for the places where the 
particles occur. These two particles move independently of each other, since 
their worldlines are divided by two null hypersurfaces z = t, z = —t. This is 
analogous to the boost-rotation symmetric spacetimes in general relativity 
that we are now going to discuss. 

Specific examples of solutions representing “uniformly accelerated parti- 
cles” have been analyzed for 35 years, starting with the first solutions of this 
type obtained by Bonnor and Swaminarayan [285] , and Israel and Khan [286] . 
In a curved spacetime the “uniform acceleration” is understood with respect 
to a fictitious Minkowski background, and the “particles” mean singularities 
or black holes. For a more extensive description of the history of these spe- 
cific solutions discovered before 1985, see [287]. From a unified point of view, 
boost-rotation symmetric spacetimes (with hypersurface orthogonal Killing 
vectors) were defined and treated geometrically in [288]. We refer to this de- 
tailed work for rigorous definitions and theorems. Here we shall only sketch 
some of the general properties and some applications of these spacetimes. 

The metric of a general boost-rotation symmetric spacetime in “Cartesian- 
type” coordinates {t, x, y, z} reads: 

ds^ = — 2 + e~^y^)dx^ + 2xy{e^ — e~^)dxdy] + 

X y 

+ 2 ^ + er^x^)dy'^ + ^ (e^z^ - e^t^)dz^ - 

x^ -I- y z^ — 

7 ^ — ^ \2zt(e^ — e^)dzdt + (e^z^ — e^t^)dt^] , (96) 

z^ — 

where y and A are functions of = x'^ + y^ and As a consequence 

of the vacuum Einstein equations, the function y must satisfy an equation 
of the form which is identical to the flat-space wave equation; and function 
A is determined in terms of y by quadrature. Now it can easily be seen that 
the metric (96) admits axial and boost Killing vectors which have exactly the 
same form as in Minkowski space, i.e. the axial Killing vector d/dip and the 
boost Killing vector (95). In fact, the whole structure of group orbits in boost- 
rotation symmetric curved spacetimes outside the sources (or singularities) 
is the same as the structure of the orbits generated by the axial and boost 
Killing vectors in Minkowski space. In particular, the boost Killing vector 
(95) is timelike in the region z^ > . The invariance of a metric (or of 
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any other field) in a time-direction (determined in a coordinate-free manner 
by a timelike Killing vector) means stationarity, and of course, we could 
hardly expect to find radiative properties there. Intuitively, the existence of 
a timelike Killing vector in the region is understandable because 

there (generalized) uniformly accelerated reference frames can be introduced 
in which sources are at rest, and the fields are time independent. 




Fig. 12. Two particles uniformly accelerated in opposite directions. The orbits of 
the boost Killing vector (thinner hyperbolas) are spacelike in the region . 



However, in the other “half” of the spacetime, the boost Killing 

vector (95) is spacelike (see the lines representing orbits of the boost Killing 
vector in Fig. 12). Hence in this region the metric (96) is nonstationary. 
Here we expect to discover radiative properties. Indeed, it can be shown that 
for > z'^ + the metric (96) can locally be transformed into the metric 
of Einstein-Rosen cylindrical waves. Although locally in the whole region 
the metric (96) can be transformed into a radiative metric, the global 
properties of the boost-rotation symmetric solutions are quite different from 
those of cylindrical waves. Again, we have to refer to the work [288] for a 
detailed analysis. Let us only say that the boost-rotation symmetric solutions, 
if properly defined - with appropriate boundary conditions on functions A and 
p, - always admit asymptotically flat null infinity at least locally. Starting 
with arbitrary solutions A and p, and adding suitable constants to both A and 
p (Einstein’s equations are then still satisfied), we can always guarantee that 
even global J exists in the sense that it admits smooth spherical sections. 
For the special type of solutions for A and p, complete J satisfies Penrose’s 
requirements, except for four points in which the sources “start” and “end” 
(cf. Fig. 13). In all cases one finds that the gravitational field in smooth 
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regions of the null infinity is radiative [279,289]. In particular, the leading 
term of the Riemann curvature tensor, proportional to r~^ (where = 
) , is nonvanishing and has the same algebraic structure as the Riemann 
tensor of plane waves. This is fully analogous to the asymptotic properties 
of radiative electromagnetic fields outside finite sources. Recently, general 
forms of the news functions have been obtained for electrovacuum spacetimes 
with boost-rotation symmetry and with Killing vectors which need not be 
hypersurface orthogonal [279]. 




Fig. 13. The Penrose compactified diagram of a boost-rotation symmetric space- 
time. Null infinity can admit smooth sections. 



It is well known that in general relativity the “causes” of motion are always 
incorporated in the theory - in contrast to electrodynamics where they need 
not even be describable by Maxwell’s theory. In a general case of the boost- 
rotation symmetric solutions there exist nodal (conical) singularities of the 
metric distributed along the z-axis which can be considered as “strings” , and 
cause particles to accelerate. They reveal themselves also at J . However, the 
distribution of nodes can always be arranged in such a manner that admits 
smooth regular sections as mentioned above. 

In exceptional cases, when is regular except for four points, either 
the particles are “self-accelerating” due to their “inner” multipole structure, 
which has to include a negative mass; or there are more particles distributed 
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along z > 0 (and symmetrically along z < 0) with the signs and the magni- 
tudes of their masses and acceleratioirs choseir appropriately. (For the concept 
of a negative mass in general relativity, and the first discussion of a “chas- 
iirg” pair of a positive aird a iregative mass particle, see classical papers by 
Bondi, and Bonnor and Swaminarayan [285].) An infinite number of differ- 
ent analytic solutions representing self-accelerating particles was constructed 
explicitly [290]. Although a negative mass cannot be bought easily iir the 
shop (as Bondi liked to say), these solutions are the only exact solutions of 
Einstein’s equations available today for which one can find such quantities 
of physical interest as radiation patterns (angular distribution of gravita- 
tional radiation), or total radiatioir powers [287]. From a mathematical poiirt 
of view, these solutions represeirt the only knowir spacetimes in which ar- 
bitrarily strong (boost-rotation symmetric) initial data can be choseir on a 
hyperboloidal hypersurface in the region t^ > z^, which will lead to a com- 
plete smooth null infinity and a regular timelike future infinity. With these 
specific examples one thus does not have to require weak-field initial data 
as one has to in the work of Friedrich, and Christodoulou and Klainerman, 
mentioned at the end of Sect. 1.3. 

The boost-rotation symmetric radiative spacetimes can be used as test 
beds for approximation methods or numerical relativity. Bicak, Reilly and 
Winicour [291] found the explicit boost-rotation symmetric “initial null cone 
solution”, which solves initial hypersurface and evolution equations in “ra- 
diative” coordinates employed in the null coire version of numerical relativity. 
This solution has been used for checking and improving numerical codes for 
computing gravitational radiation from more realistic sources; a new solution 
of this type has also been found [292]. Recently, the specific boost-rotation 
symmetric spacetimes constructed in [290] were used as test beds in the stan- 
dard version of numerical relativity based on spacelike hypersurfaces [293]. 

There exist “generalized” boost-rotation symmetric spacetimes which are 
not asymptotically flat, but are of considerable physical interest. They de- 
scribe accelerated particles in asymptotically “uniform” external fields. One 
can construct such solutions from asymptotically flat boost-rotation symmet- 
ric solutions for the pairs of accelerated particles by a limiting procedure, in 
which one member of the pair is “removed” to infinity, and its mass param- 
eter is simultaneously increased [294]. Since the resulting spacetimes are not 
asymptotically flat, their radiative properties are not easy to analyze. Only 
if the external field is weak will there exist regions in which the spacetimes 
are approximately flat; and here their radiative properties might be investi- 
gated. So far no systematic analysis of these spacetime has been carried out. 
Nevertheless, they appear to offer the best rigorous examples of the motion 
of relativistic objects. No nodal singularities or negative masses are necessary 
to cause an acceleration. 

As an eloquent example of such a spacetime consider a charged (Reissner- 
Nordstrom) black hole with mass M and charge Q, immersed in an electric 
field “uniform at infinity”, characterized by the field-strength parameter E. 
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An exact solution of the Einstein-Maxwell equations exists which describes 
this situation [131]. It goes over into an approximate solution obtained by 
perturbing the charged black hole spacetime by a weak external electric field 
which is uniform at infinity [295]. One of the results of the analysis of this 
solution is very simple: a charged black hole in an electric field starts to 
accelerate according to Newton’s second law, Ma = QE, where all the quan- 
tities can be determined - and in principle measured - in an approximately 
fiat region of the spacetime from the asymptotic form of the metric. Recall 
T. S. Eliot again: “There is only the fight to recover what has been lost / 
And found and lost again and again.” 

These types of generalized boost-rotation symmetric spacetimes ( “gener- 
alized C-metrics”) have been used by Hawking, Horowitz, Ross, and others 
[296] in the context of quantum gravity ~ to describe production of black hole 
pairs in strong background fields. 

Recently, we have studied the spinning C-metric discovered by Plebahski 
and Demiahski [297]. Transformations can be found which bring this metric 
into the canonical form of spacetimes with boost-rotation symmetry [298]. 
The metric represents two uniformly accelerated, rotating black holes, either 
connected by conical singularity, or with conical singularities extending from 
each of them to infinity. The spacetime is radiative. No other spacetime of 
this type, with two Killing vectors which are not hypersurface orthogonal, is 
available in an explicit form. 



12 The Cosmological Models 

In light of Karl Popper’s belief that “all science is cosmology”, it appears 
unnecessary to justify the choice of solutions for this last section. As in the 
whole article, these will be primarily vacuum solutions. On the other hand, 
in light of the light coming from about 10^^ galaxies, each with about 10^^ 
stars, it may seem weird to consider vacuum models of the Universe. Indeed, 
it has become part of the present-day culture that spatially homogeneous 
and isotropic, expanding Friedmann-Robertson- Walker (FRW) models, filled 
with uniformly distributed matter, correspond well to basic observational 
data. In order to achieve a more precise correspondence, it appears sufficient 
to consider just perturbations of these “standard cosmological models”. To 
explain some “improbable” features of these models such as their isotropy 
and homogeneity, one finds an escape in inflationary scenarios. These views 
of a “practical cosmologist” are, for example, embodied in one of the most 
comprehensive recent treatise on physical cosmology by Peebles [28] . 

Theoretical (or mathematical) cosmologists, however, point out that more 
general cosmological models exist which differ significantly from a FRW 
model at early times, approach the FRW model very closely for a certain 
epoch, and may diverge from it again in the future. Clearly, the FRW uni- 
verses represent only a very special class of viable cosmological models. 
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though the simplest and most suitable for interpretations of “fuzzy” cos- 
mological observational data. 

Simple exact solutions play a significant role in the evolution of more 
general models, either as asymptotic or intermediate states. By an “interme- 
diate state” one means the situation when the universe enters and remains 
in a small neighbourhood of a saddle equilibrium point. A simple example 
is the Lemaitre matter- filled, homogeneous and isotropic model with a non- 
vanishing cosmological constant (see e.g. [28]), which expands from a dense 
state (the big bang, or “primeval atom” in Lemaitre’s 1927 terminology), 
passes through a quasistatic epoch in which all parameters are close to those 
of the static Einstein universe (cf. Sect. 1.2), and then the universe expands 
again. An “asymptotic state” means close either to an initial big bang (or 
possibly a final big crunch) singularity, or the situation at late times in for- 
ever expanding universes. It is easy to see that at late times in indefinitely 
expanding universes the matter density decreases, and vacuum solutions may 
become important. However, as we shall discuss below, vacuum models play 
an important role also close to a singularity, when the matter terms in Ein- 
stein’s equations are negligible compared to the “velocity terms” (given by 
the rate of change of scale factors) or to the curvature terms (characteriz- 
ing the curvature of spacelike hypersurfaces). In particular, the pioneering 
(and still controversial) work started at the end of the 1950s by Lifshitz and 
Khalatnikov, and developed later on by Belinsky, Khalatnikov and Lifshitz, 
has shown that the fact that the presence of matter does not influence the 
qualitative behaviour of a cosmological model near a singularity has a very 
general significance (see [299] and [300] for the main original references, and 
[139] for a brief review). 

In gaining an intuition in the analysis of general cosmological singulari- 
ties, the class of spatially homogeneous anisotropic cosmological models have 
played a crucial role. These so called Bianchi models admit a simply transi- 
tive 3-dimensional homogeneity group. Among the Bianchi vacuum models 
there are special exact explicit solutions, in particular the Kasner and the 
Bianchi type II solutions, which exhibit some aspects of general cosmological 
singularities. The Bianchi models have also had an impact on other issues in 
general relativity and cosmology. 

Much work, notably in recent years, has been devoted to the class of 
both vacuum and matter-filled cosmological solutions which are homogeneous 
only on 2-dimensional spacelike orbits. Thus they depend on time and on 
one spatial variable, and can be used to study spatial inhomogeneities as 
density fluctuations or gravitational waves. The vacuum cosmological models 
with two spacelike Killing vectors, sometimes called the Gowdy models , are 

In fact, by Gowdy models, one more often means only the cases with closed group 
orbits, with two commuting spacelike othogonally-transitive Killing vectors (the 
surface elements orthogonal to the group orbits are surface- forming) . 
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interpreted as gravitational waves in an expanding (or contracting) universe 
with compact spatial sections. We shall discuss these two classes separately. 

12.1 Spatially Homogeneous Cosmologies 

The simplest solutions, the Minkowski, de Sitter, and anti de Sitter space- 
times, which have also been used in cosmological contexts (cf. Sect. 1.3), 
are A- dimensionally homogeneous. As noted in Sect. 8.1, the vacuum plane 
waves (see equations (51), (53), (61)) are also homogeneous spacetimes; and 
since they can be suitably sliced by spacelike hypersurfaces with expand- 
ing normal congruence, they can become asymptotic states in homogeneous 
expanding cosmologies. There exist several important non-vacuum homoge- 
neous spacetimes, for example, the Einstein static universe (cf. Sect. 1.2), 
and Godel’s stationary, rotating universe (see e.g. [61,301]), famous for the 
first demonstration that Einstein’s equations with a physically permissible 
matter source are compatible with the existence of closed timelike lines, i.e. 
with the violation of causality. 

Here we shall consider models in which the symmetry group does not make 
spacetime a homogeneous space, but in which each event in spacetime is con- 
tained in a spatial hypersurface that is homogeneous. The standard FRW 
models represent a special case of such models (they admit, in addition, an 
isotropy group 5'0(3) at each point). The general spatially homogeneous so- 
lutions comprise of the Kantowski-Sachs universes and a much wider class 
of Bianchi models. By definition, the Bianchi models admit a simply tran- 
sitive 3-dimensional homogeneity group G3. There exist special “locally ro- 
tationally symmetric” (LRS) Bianchi models which admit a 4-dimensional 
isometry group G4 acting on homogeneous spacelike hypersurfaces, but these 
groups have a simply transitive subgroup G3. In contrast to this, Kantowski- 
Sachs spacetimes admit G4 (acting on homogeneous spacelike hypersurfaces) 
which does not have any simply transitive subgroup G3; it contains a mul- 
tiply transitive G3 acting on 2-dimensional surfaces of constant curvature, 
G 4 = JR X SO{3). A special case of the vacuum Kantowski-Sachs universe 
is represented by the Schwarzschild metric inside the horizon (with t and r 
interchanged). There has been a continuing interest in the Kantowski-Sachs 
models since their discovery in 1966 [302], to which, as the authors acknowl- 
edge, J. Ehlers contributed by his advice. Some of these models had already 
appeared in the PhD thesis of Kip Thorne in 1965 (see also [303] for mag- 
netic Kantowski-Sachs models). Here, however, we just refer the reader to 
[304,305] for their classical description, to [306] for a canonical and quan- 
tum treatment, and to [307] for the latest discussion of the Kantowski-Sachs 
quantum cosmologies. 

Although the 3-dimensional Lie groups which are simply transitive on 
homogenous 3-spaces were classified by Bianchi in 1897, the importance of 
Bianchi ’s work for constructing vacuum cosmological models was only dis- 
covered by Taub in 1951 [197], when the Taub space (cf. Sect. 7) was first 
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given. It is less known that at approximately the same time, if not earlier, 
the first explicit spatially homogeneous expanding and rotating cosmological 
models with matter (of the Bianchi type IX) were constructed by Godel,^^ 
who first presented his results at the International Congress of Mathematics 
held at Cambridge (Mass.) from August 30 till September 5, 1950. 

An exposition of Bianchi models has been given in a number of places: 
in the account on relativistic cosmology by Beckmann and Schiicking [308] 
(complementing the chapter on exact solutions by Ehlers and Kundt [53]), 
in the monographs of Ryan and Shepley [304] , and Zel’dovich and Novikov 
[309], in several comprehensive surveys by MacCallum (see e.g. [305] and [310] 
for his latest review containing a number of references) , most recently, in the 
book on the dynamical system approach in cosmology (in the Bianchi models 
in particular) edited by Wainwright and Ellis [311]; and, first but not least, in 
the classics of Landau and Lifshitz [139]. The Hamiltonian approach initiated 
by Misner [312] in 1968, and used in, amongst other things, the construction 
of various minisuperspace models in quantum gravity, has been reviewed by 
Ryan [266]; for more recent accounts, see several contributions to Misner’s 
Festschrift [313]. An interesting framework which unifies the Hamiltonian 
approach to the solutions which admit homogeneous hypersurfaces either 
spacelike (as Bianchi models) or timelike (as static spherical, or stationary 
cylindrical models) was recently developed by Uggla, Jantzen and Rosquist 
in [314] (with 115 references on many exact solutions). Herewith we shall only 
briefly introduce the Bianchi models, note their special role in understanding 
the character of an initial cosmological singularity, and mention some of the 
most recent developments not covered by the reviews cited above. 

The line element of the Bianchi models can be expressed in the form 

= -dt^ + gab{t) io°- u!^, (97) 

where the time-independent 1-forms (= E^dx°"), a = 1,2,3, are dual to 
time-independent^^ spatial frame vectors (often an arbitrary time- variable 
i is introduced by dt = N{i) di, N being the usual lapse function). Both 

Godel’s profound ideas and results in cosmology, and their influence on later 
developments have been discussed in depth by G. Ellis in his lecture at the Godel 
’96 conference in Brno, Czech Republic, where Godel was born in 1906 (78 years 
before Godel, Ernst Mach was born in a place which today belongs to Brno). 
In the extended written version of Ellis’ talk [301] it is indicated that Godel’s 
work also initiated the investigation of Taub. This may well be true with Godel’s 
paper on the stationary rotating universe, but Taub’s paper on Bianchi models 
was received by the Annals of Mathematics on May 15, 1959, i.e. before Godel’s 
lecture on expanding and rotating models at the Congress of Mathematics took 
place. 

The gravitational degrees of freedom are associated with the component (scalar) 
functions gab(t) - the so called metric approach. Alternatively, in the orthonormal 
frame approach, one chooses gab(t) = Sab and describes the evolution by time- 
dependent forms u)°‘ . In still another approach one employs the automorphism 
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and Eq are group-invariant, commuting with the three Killing fields which 
generate the homogeneity group. They satisfy the relations 

(98) 

[E,,E,] = C=,E^ (99) 

where d is the exterior derivative and are the structure constants of the 
Lie algebra of the homogeneity group. The models are classified according to 
the possible distinct sets of the structure constants. They are first divided 
into two classes: in class A the trace = 0, and in class B, 0. In 

class A one can choose = n^°''>eabc (no summation over a), and classify 
various symmetry types by parameters with values 0, ±1. In class B, in 
addition to one needs the value of a constant scalar h (related to 
to characterize types VI/i and Vll/t (see e.g. [311]). 

The simplest models are the Bianchi I cosmologies in class A with = 
0, i.e. = 0, so that all three Killing vectors (the group generators) com- 
mute. They contain the standard Einstein-de Sitter model with flat spatial 
hypersurfaces (curvature index k = 0). In the vacuum case, all Bianchi I 
models are given by the well-known 1-parameter family of Kasner metrics 
(found in 1921 by E. Kasner and in 1933 by G. Lemaitre without considering 
the Bianchi groups) 

where 

Pi +P2 +P3 = 1, P? +P2 +P3 = 1- (101) 

These metrics were first used to investigate various effects in anisotropic cos- 
mological models. For example, in contrast to standard FRW models with 
“point-like” initial singularities, the Kasner metrics can permit the so called 
“cigar” and “pancake” singularities. To be more specific, consider the congru- 
ence of timelike lines with unit tangent vectors n“ orthogonal to constant time 
hypersurfaces, and define the expansion tensor 0^/3 by Oap = Uap + ^9 hap, 
where hap = gap + naUp is a projection tensor, aap = ~ ^9hap is the 

shear, and 9 = 9^. Determining the three spatial eigenvectors of 9ap with the 
corresponding eigenvalues 9i (i = 1,2, 3), one can define the scale factors k 
by the relation 9i = [dli/dt)/li , and the Hubble scalar H = \{9i + 92 + 6 * 3 ). 
In the FRW models, all 0 at the big bang singularity. In the Kasner 
models at t ^ 0 one finds that either two of the k go to zero, whereas the 
third unboundedly increases (a cigar); or one of the li tends to zero, while 
the other two approach a finite value (pancake). Also there is the “barrel” 
singularity in which the two of the U go to zero, and the third approaches 

of the symmetry group to simplify the spatial metric Qab (see [310,311] for more 
details). 
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a finite value. There is an open question as to whether some other possibili- 
ties exist [311]. Even in the perfect fluid Kasner model, the approach to the 
singularity is “velocity- dominated” - the “vacuum terms” given by the rates 
of change of the scale factors dominate the “matter terms” (curvature terms 
vanish since the Kasner models are spatially fiat). 

The general vacuum Bianchi type II cosmologies (with one = -|-1, 
and the other two vanishing), discovered by Taub in [197], contain two free 
parameters: 

ds^ = -A^dt^ + A~^ t^P^(dx-h4pibz dyf -h A^(t^P^ dy^ + dz^), (102) 

where 

= 1 + , Pi + P2 + P3 = I, pI-^pI-^pI = 1. (103) 

If we put the parameter 6 = 0, the metrics (102) become the Kasner solutions 
(100). Near the big bang the general Bianchi type II solution is asymptotic 
to a Kasner model. In the future it is asymptotic again to a Kasner model, 
but with different values of parameters pi (see e.g. [311]). This fact will be 
important in the following. 

The general Bianchi type V vacuum solutions are also known - these are 
given by the 1-parameter family of Joseph solutions [311]. The type V models 
are the simplest metrics in class B (with all = 0 but = 2aj^i5“j, at = 
constant), and are the simplest Bianchi models which contain the standard 
FRW open universes {k = —1). The Joseph solutions are asymptotic to the 
specific Kasner solution in the past, and tend to the “isotropic Milne model” 
in the future. This is intuitively understandable since open FRW models, 
as they expand indefinitely into the future with matter density decreasing, 
also approach the Milne model. As is well known, the Milne model is just an 
empty fiat (Minkowski) spacetime in coordinates adapted to homogeneous 
spacelike hypersurfaces (the “mass hyperboloids”), with expanding normals 
(see e.g. [28]): 

ds^ = —dr^ [{1 p^)~^dp^ p^{d6^ siii^ 9 dp^)] , ( 104 ) 

with T = f(l — p = u(l — u^), M = r/t < 1, where t,r,9,p are standard 

Minkowski (spherical) coordinates. Because of its significance as an asymp- 
totic solution and its simplicity, the Milne model has been used frequently 
in pedagogical expositions of relativistic cosmology (see e.g. [28,214]) as well 
as in cosmological perturbation theory and quantization (see e.g. [315] and 
references therein). The Milne universe is also an asymptotic state of other 
Bianchi models such as, for example, the intriguing Lukash vacuum type VIR 
solution [316], which can be interpreted as two monochromatic, circularly po- 
larized waves of time-dependent amplitude travelling in opposite directions 
on a FRW background, with fiat or negative curvature spacelike sections. As 
was noticed earlier, some indefinitely expanding Bianchi models approach the 
homogeneous plane wave solutions. Barrow and Sonoda [317] studied the fu- 
ture asymptotic behaviour of the known Bianchi solutions in detail by using 
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nonlinear stability techniques; and in [311] dynamical system methods were 
used. 

From the late 1960s onwards the greatest amount of work was probably 
devoted to the Bianchi type IX vacuum models, baptized the Mixmaster 
universe^^ by Misner [312]. Type IX models are the most general class A 
models with all parameters = +1. They are the only Bianchi universes 
which recollapse. If a perfect fluid is permitted as the matter source, the 
non-vacuum type IX solutions contain the closed FRW models (fc = -1-1) 
with space sections having spherical topology. As a Bianchi I space admits a 
group isomorphic with translations in a 3-dimensional Euclidean space, the 
group of type IX spaces is isomorphic to the group of rotations. None of 
the pairs of three Killing vectors commute. A general Bianchi IX vacuum 
solution is not known, but a particular solution is available: the Taub-NUT 
spacetime, or rather, its spatially homogeneous anisotropic region - the Taub 
universe (see Sect. 7.2). This fact was, for example, employed in an attempt 
to understand the limitations of the minisuperspace methods of quantum 
gravity: by reducing the degrees of freedom to a general Mixmaster universe 
and then further to the Taub universe one can see what such restrictions 
imply [318]. 

The dynamics of general Bianchi cosmologies - and of the Mixmaster 
models in particular ~ close to the big bang singularity has been approached 
with essentially three methods [311]: (i) piecewise approximation methods, 
(ii) Hamiltonian methods, and (iii) dynamical system methods. In the first 
method, used primarily by Russian cosmologists (cf. [299,300]), the evolution 
is considered to be a sequence of periods in which certain terms in the Ein- 
stein equations dominate whereas other terms can be neglected. The Hamil- 
tonian methods appeared first in the “Mixmaster paper” by Misner [312], 
were reviewed by Ryan [266], and more recently by Uggla in [311]. With the 
Hamiltonian (canonical) approaches, minisuperspace methods entered gen- 
eral relativity (cf. Sect. 9.2 on midisuperspace for cylindrical waves). In this 
approach, infinitely many degrees of freedom are reduced to a finite number: 
the state of the universe is described by a “particle” moving inside and re- 
flecting instantaneously from the moving potential walls, which approximate 
the time-dependent potentials in the Hamiltonian. In the third method one 
employs the fact that Einstein’s equations in the case of homogeneous cos- 
mologies can be put into the form of an autonomous system of first-order 
(ordinary) differential equations on a finite dimensional space IR". This is 
of the form dx/dt = f(x), with x G JR" representing a state of the model 
(for example, the suitably normalized components of the shear tr, the Hubble 

The name comes from the fact that, in contrast to a standard FRW model, which 
has a horizon preventing the equalization of possible initial inhomogeneities over 
large scales, the horizon in a type IX nniverse is absent, so that mixing is in prin- 
ciple possible. However, as was shown e.g. in [309], “repeated circumnavigations 
of the universe by light are impossible in the Mixmaster model” . 
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scalar 9, and parameters related to n^°'\ can serve as the “components” of x). 
A study of the orbits x(t) indicates the behaviour of the model. Dynamical 
system methods are the focus of the book [311]. They also are the main tools 
of the monograph [319]. 

In the case of the Bianchi IX models (either vacuum or with perfect fluid), 
all three methods imply (though do not supply a rigorous proof) that an ap- 
proach to the past big bang singularity is composed of an infinite sequence of 
intervals, in each of which the universe behaves approximately as a specific 
Kasner model (100). The transition “regimes” between two different subse- 
quent Kasner epochs, in which the contraction proceeds along subsequently 
different axes, is approximately described by Bianchi type II vacuum solutions 
(102). This famous and enigmatic “oscillatory approach to the singularity” 
(or “Mixmaster behaviour”) has rightly entered the classical literature (cf. 
e.g. [18,19,139]). It indicates that the big bang singularity (and, similarly, a 
singularity formed during a gravitational collapse) can be much more com- 
plicated than the “point-like” singularity in the standard FRW models. This 
oscillatory character has been suggested not only by the qualitative methods 
mentioned above, but also by extensive numerical work (see e.g. [311,320]). 
So far, however, it has resisted a rigorous proof. 

In the “standard” picture of the Mixmaster model it is supposed that the 
evolution of the Bianchi type IX universe near the singularity can be approxi- 
mated by a mapping of the so called Kasner circle onto itself. This is the unit 
circle in the plane, where S± = a±/ H describes the anisotropy in the 

Hubble flow (cf. e.g. Fig. 6.2 in [311]). Each point on the circle corresponds 
to a specific Kasner solution with given fixed values of parameters pi satis- 
fying the conditions (101). There are three exceptional points on the circle 
- those at which one of the pi = -1-1, and the other two vanish. From each 
non-exceptional point P\ on the Kasner circle there leads a 1-dimensional 
unstable orbit given by the vacuum Bianchi II solution (102), which joins 
Pi to another point P 2 on the circle, then P 2 is mapped to P 3 , etc. This 
“Kasner map” in the terminology of [311], called frequently also the BKL 
(Belinsky-Khalatnikov-Lifshitz) map, describes subsequent changes of Kas- 
ner epochs during the oscillatory approach to a singularity. Recent rigorous 
results of Rendall [321] show that for any finite sequence generated by the 
BKL map, there exists a vacuum Bianchi type IX solution which reproduces 
the sequence with any required accuracy.^® 

The vacuum Bianchi IX models have been extensively analyzed in the 
context of deterministic chaos and their stochasticity, attracting the interest 
of leading experts in these fields [320,322]. Above all, it is the numerical work 
which strongly suggests that it is impossible to make long-time predictions of 

A. Rendall (private communication) reports that the main points of the BKL 
picture for homogeneous universes have been rigorously confirmed in a recent 
work of H. Ringstrom (to be published). 
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the evolution of the system from the initial data, which is the most significant 
property of a chaotic system. 

Most recently, interest in the Bianchi cosmologies with (homogeneous) 
magnetic and scalar fields has been revived. Following his previous work with 
Wainwright and Kerr on magnetic Bianchi VIq cosmologies [323], LeBlanc 
[324] has shown that even in Bianchi I cosmologies one finds an oscillatory 
approach towards the initial singularity if a magnetic field in a general di- 
rection is present. (The points on the Kasner circle are now joined by Rosen 
magneto-vacuum solutions.) Hence, Mixmaster-like oscillations occur due to 
the magnetic field degrees of freedom, even in the absence of an anisotropic 
spatial curvature (present in the vacuum type IX models) - the result an- 
ticipated by Jantzen [325] in his detailed work on Hamiltonian methods for 
Bianchi cosmologies with magnetic and scalar fields. Similar conclusions have 
also been arrived at in [326] for magnetic Bianchi II cosmologies. (LeB lane’s 
papers contain some new exact magnetic Bianchi solutions and a number of 
references to previous work.) Interestingly, in contrast to the magnetic field, 
scalar fields in general suppress the Mixmaster oscillations when approaching 
the initial singularity [327,328]. 

The theory of spatially homogeneous, anisotropic models is an elegant, in- 
triguing branch of mathematical physics. It has played an important role in 
general relativity. The classical monograph of Zel’dovich and Novikov [309], 
or the new volume of Wainwright and Ellis [311] analyze in detail the possi- 
ble observational relevance of these models: they point out spacetimes close 
to FRW cosmologies (at least during an epoch of finite duration) which are 
compatible with observational data. For the most recent work on Bianchi 
VH/i cosmologies which are potentially compatible with the highly isotropic 
microwave background radiation, see [329] (and references therein). Never- 
theless, the present status is such that, in contrast to for example the Kerr 
solution, which is becoming an increasingly strong attractor for practical as- 
trophysicists (cf. Sect. 4.3), the anisotropic models have not really entered 
(astro)physical cosmology so far. Peebles, for example, briefly comments in 
[28]: “The homogeneous anisotropic solutions allowed by general relativity 
are a very useful tool for the study of departures from the Robertson-Walker 
line element. As a realistic model for our Universe, however, these solutions 
seem to be of limited interest, for they require very special initial conditions: 
if the physics of the early universe allowed appreciable shear, why would it 
not also allow appreciable inhomogeneities?” 

An immediate reaction, of course, would be to point out that the FRW 
models require still more “special initial conditions” . However, there appears 
to be a deeper reason why the oscillatory approach towards a singularity may 
be of fundamental importance. Belinsky, Khalatnikov and Lifshitz [299,300] 
employed their piecewise approximation method, and concluded 30 years ago 
that a singularity in a general, inhomogeneous cosmological model is spacelike 
and locally oscillatory: i.e. in their scenario, the evolution at different spatial 
point decouples. At each spatial point the universe approaches the singular- 
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ity as a distinct Mixmaster universe. This view, often criticized by purists, 
appears now to be gaining an increasing number of converts, even among the 
most rigorous of relativists. As mentioned above, the homogeneous magnetic 
Bianchi type VIq models, investigated by LeBlanc et ah, show Mixmaster 
behaviour. The Bianchi VIq models have, as do all Bianchi models, three 
Killing vectors, but two of them commute. The models can thus be general- 
ized by relaxing the symmetry connected with the third Killing vector; one 
can so obtain effectively the inhomogeneous (in one dimension) Gowdy-type 
spacetimes. Weaver, Isenberg and Berger [330], following this idea of Ken- 
dall, analyzed these models numerically, and discovered that the Mixmaster 
behaviour is reached at different spatial points. The numerical evidence for 
an oscillatory singularity in a generic vacuum U(l) symmetric cosmologies 
with the spatial topology of a 3-torus has been found still more recently by 
Berger and Moncrief [331]. 

Before turning to the Gowdy models, a last word on the “oscillatory ap- 
proach towards singularity”. I heard E. M. Lifshitz giving a talk on this issue 
a couple of times, with Ya. B. Zel’dovich in the audience. In discussions after 
the talk, Zel’dovich, who appreciated much this work (its detailed description 
is included in [309]), could not resist pointing out that the number of oscil- 
lations and Kasner epochs will be very limited (to only about ten) because 
of quantum effects which arise when some scale of a model is smaller than 
the Planck length Ipi ^ 10“^^cm. This, however, seems to make the scenario 
still more intriguing. If this is confirmed rigorously within classical relativity, 
how will a future quantum gravity modify this picture? 

12.2 Inhomogeneous Cosmologies 

Among all of the known vacuum inhomogeneous models, the Gowdy solutions 

[332] have undoubtedly played the most distinct role. They belong to the 
class of solutions with two commuting spacelike Killing vectors. Within a 
cosmological context, they form a subclass of a wider class of G 2 cosmologies 
~ as are now commonly denoted models which admit an Abelian group G 2 
of isometries with orbits being spacelike 2-surfaces. A 2-surface with a 2- 
parameter isometry group must be a space of constant curvature, and since 
neither a 2-sphere nor a 2-hyperboloid possess 2-parameter subgroups, it 
must be intrinsically flat. If the 2-surface is an Euclidean plane or a cylinder, 
then one speaks about planar or cylindrical universes. Gowdy universes are 
compact - the group orbits are 2-tori T^. 

The metrics with two spacelike Killing vectors are often called the general- 
ized Einstein-Rosen metrics as, for example, by Garmeli, Gharach and Malin 

[333] in their comprehensive survey of inhomogeneous cosmological models 
of this type. In dimensionless coordinates (t, z, x^), the line element can 
be written as {A,B = 1,2) 



ds'^/L'^ = e^{—dt^ + dz'^) + ^ABdx"^dx^ 



(105) 
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where L is a constant length, F and ^ab depend on t and z only, and thus 
the spacelike Killing vectors are = (0, 0, 1, 0), = (0, 0, 0, 1). 

The local behaviour of the solutions of this form is described by the gra- 
dient of the “volume element” of the group orbits W = (| det(7A_B)|)^^^- 
Classical cylindrical Einstein-Rosen waves (cf. Sect. 9) are obtained if W^a is 
globally spacelike. In Gowdy models, W^a varies from one region to another. 

Considering for simplicity the polarized Gowdy models (when the Killing 
vectors are hypersurface orthogonal), the metric (105) can be written in di- 
agonal form (cf. equations (62), (63), and (65), (66) in the analogous cases 
of plane and cylindrical waves) 

ds^/L^ = [e^'^(-df + dz^) + W^dy^] + e^’^dx^, (106) 

in which U(t,z) and ■j(t,z) satisfy wavelike dynamical equations and con- 
straints following from the vacuum Einstein equations; the function W(t, z), 
which determines the volume element of the group orbit, can be cast into 
a standard form which depends on the topology of t = constant spacelike 
hypersurfaces S. 

As mentioned above, in Gowdy models one assumes these hypersurfaces 
to be compact. Gowdy [332] has shown that S can topologically be (i) a 
3-torus = 5^ 0 5”^ 0 and W = t (except for the trivial case when 
spacetime is identified as a Minkowski space), (ii) a 3-handle (or hypertorus, 
or “closed wormhole”) 0 S'^ with W = sin z sin t, or (iii) a 3-sphere S^, 
again with W = sin 2 ; sin f. (For some subtle cases not covered by Gowdy, see 
[334].) As the form of W suggests, in the case of a topology, the universe 
starts with a big bang singularity at t = 0 and then expands indefinitely, 
whereas in the other two cases it starts with a big bang at t = 0, expands 
to some maximal volume, and then recollapses to a “big crunch” singularity 
at t = 7T. One can determine exact solutions for metric functions in all three 
cases in terms of Bessel functions [335]. Hence, for the first time cosmological 
models closed by gravitational waves were constructed. Gharach found Gowdy 
universes with some special electromagnetic fields [336] , and other generalized 
Gowdy models were obtained. We refer to the detailed survey [333] for more 
information, including the work on canonical and quantum treatments of 
these models, done at the beginning of the 1970s by Berger and Misner, and 
for extensive references. 

Let us only add a few remarks on some more recent developments in which 
the Gowdy models have played a role. Gowdy-type models have been used 
to study the propagation and collision of gravitational waves with toroidal 
wavefronts (as mentioned earlier, 2-tori are the group orbits in the Gowdy 

The same is true in the boost-rotation symmetric spacetimes considered in Sect. 
11: the part F > of the spacetimes, where the boost Killing vector is spacelike, 
can be divided into four different regions, in two of which vector is spacelike, 
and in the other two timelike - see [288] for details. 
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cosmologies) in the FRW closed universes with a stiff ffuid [337]. In the stan- 
dard Gowdy spacetimes it is assumed that the “twists” associated with the 
isometry group on vanish. In [338] the generalized Gowdy models without 
this assumption are considered, and their global time existence is proved. 

As both interesting and non-trivial models, the Gowdy spacetimes have 
recently attracted the attention of mathematical and numerical relativists 
with an increasing intensity, as indicated already at the end of the previous 
section. Ghrusciel, Isenberg and Moncrief [339] proved that Gowdy spacetimes 
developed from a dense subset in the initial data set cannot be extended past 
their singularities, i.e. in “most” Gowdy models the strong cosmic censorship 
is satisfied. 

On cosmic censorship and spacetime singularities, especially in the context 
of compact cosmologies, we refer to a review by Moncrief [340] , based on his 
lecture in the GR14 conference in Florence in 1995. The review shows clearly 
how intuition gained from such solutions as the Gowdy models or the Taub- 
NUT spaces, when combined with new mathematical ideas and techniques, 
can produce rigorous results with a generality out of reach until recently. To 
such results belongs also the very recent work of Kichenassamy and Rendall 
[341] on the sufficiently general class of solutions (containing the maximum 
number of arbitrary functions) representing unpolarized Gowdy spacetimes. 
The new mathematical technique, developed by Kichenassamy [342], the so 
called Fuchsian algorithm, enables one to construct singular (and nonsingu- 
lar) solutions of partial differential equations with a large number of arbitrary 
functions, and thus provide a description of singularities. Applying the Fuch- 
sian algorithm to Einstein’s equations for Gowdy spacetimes with topology 
T^, Kichenassamy and Rendall have proved that general solutions behave at 
the (past) singularity in a Kasner-like manner, i.e. they are asymptotically 
velocity dominated with a diverging Kretschmann (curvature) invariant. One 
needs an additional magnetic field not aligned with the two Killing vectors of 
the Gowdy unpolarized spacetimes in order to get a general oscillatory (Mix- 
master) approach to a singularity, as shown by the numerical calculations 
[330] mentioned at the end of the previous section. 

Much of the work on exact inhomogeneous vacuum cosmological mod- 
els has been related to “large perturbations” of Bianchi universes. In [343] 
the authors confined attention to “plane wave” solutions propagating over 
Bianchi backgrounds of types I- VII. They found universes which are highly 
inhomogeneous and “chaotic” at early times, but are transformed into clearly 
“recognizable” gravitational waves at late times. 

Other types of metrics can be considered as exact “gravitational solitons” 
propagating on a cosmological background. These are usually obtained by ap- 
plying the inverse scattering or “soliton” technique of Belinsky and Zakharov 
[344] to particular solutions of Einstein’s equations as “seeds” . For example, 
Garr and Verdaguer [345] found gravisolitons by applying the technique to 
the homogeneous Kasner seed. Similarly to previous work [343], their models 
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are very inhomogeneous at early times, but evolve towards homogeneity in a 
wavelike manner at late times. 

More recently, Belinsky [346] , by applying a two-soliton inverse scattering 
technique to a Bianchi type VIq solution as a seed, constructed an intriguing 
solution which he christened as a “gravitational breather”, in analogy with the 
Gordon breather in the soliton theory of the sine-Gordon equation. Gravisoli- 
tons and antigravisolitons, characterized by an opposite topological charge, 
can be heuristically introduced and shown to have an attractive interaction. 
The breather is a bound state of the gravisoliton and antigravisoliton. Be- 
linsky suggests that a time oscillating breather exists; but a later discussion 
[347] indicates that the oscillations quickly decay. Alekseev, by employing his 
generalization of the inverse scattering method to the Einstein-Maxwell the- 
ory, obtained exact electrovacuum solutions generalizing Belinsky’s breather 
(see his review [348], containing a general introduction on exact solutions). 

Verdaguer [349] prepared a very complete review of solitonic solutions 
admitting two spacelike Killing vector fields, with the main emphasis on cos- 
mological models. Among various aspects of such solutions, he has noted 
the role of the Bel-Robinson superenergy tensor in the interpretation of cos- 
mological metrics. This tensor and its higher-order generalizations has also 
been significantly used in estimates in the proofs of long-time existence theo- 
rems [39,340]. Recently, differential conservation laws for large perturbations 
of gravitational field with respect to a given curved background have been 
fomulated [350], which found an application in solving equations for cosmo- 
logical perturbations corresponding to topological defects [351]. They should 
bring more light also on various solitonic models in cosmology. 

13 Concluding Remarks 

It is hoped that the preceding pages have helped to elucidate at least one 
issue: that in such a complicated nonlinear theory as general relativity, it is 
not possible to ask relevant questions of a general character without finding 
and thoroughly analyzing specific exact solutions of its field equations. The 
role of some of the solutions in our understanding of gravity and the universe 
has been so many-sided that to exhibit this role properly on even more than 
a hundred pages is not really feasible ... 

Although we have concentrated on only (electro)vacuum solutions, there 
remains a number of such solutions that have also played some role in various 
contexts, but, owing to the absence of additional space and time, or the 
presence of the author’s ignorance, have not been discussed. Tomimatsu-Sato 
solutions and their generalizations, static plane and cylindrical metrics, and 
some algebraically special solutions are examples. 

In his review of exact solutions, Ehlers [56] wrote 35 years ago that “it 
seems desirable to construct material sources for vacuum solutions” , and 30 
years later Bonnor [64], in his review, expressed a similar view. In the above 
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we have noted only some of the thin disk sources of static and stationary 
spacetimes in Sect. 6. To find physically reasonable material sources for many 
of the known vacuum solutions remains a difficult open task. In order to make 
solutions of Einstein’s equations with the right-hand side more tractable, 
one is often tempted to sacrifice realism and consider materials, again using 
Bondi’s phraseology, which are not easy to buy in the shops. Nevertheless, 
there are solutions representing spacetimes filled with matter which would 
certainly belong in a more complete discussion of the role of exact solutions. 

For example, one of the simplest, the spherically symmetric Schwarzschild 
interior solution with an incompressible fluid as matter source, modelling 
“a star of uniform density”, gives surprisingly good estimates of an upper 
bound on the masses of neutron stars; on a more general level, it supplies 
an instructive example of relativistic hydrostatics [18] . Many other spherical 
perfect fluid solutions are listed in [61]. The proof of a very plausible fact 
that any equilibrium, isolated stellar model which is nonrotating must be 
spherically symmetric, was finally completed in [352] and [353]. Physically 
more adequate spherically symmetric static solutions with collisionless mat- 
ter described by the Boltzmann (Vlasov) equation have been studied [101] 
(yielding, for example, arbitrarily large central redshifts); and some of their 
aspects have been recently reviewed from a rigorous, mathematical point of 
view [354] . Going over to the description of matter in terms of physical fields, 
we should mention the first spherically symmetric regular solutions of the 
Einstein~Yang“Mills equations ( “non- Abelian solitons” discovered by Bart- 
nik and McKinnon [355] in 1988), and non- Abelian black holes with “hair”, 
which were found soon afterwards. They stimulated a remarkable activity in 
the search for models in which gravity is coupled with Yang-Mills, Higgs, and 
Skyrmion fields. Very recently these solutions have been surveyed in detail 
in the review by Volkov and Gal’tsov [356] . 

The role of the standard FRW cosmological models on the development 
of relativity and cosmology can hardly be overemphasized. As for two more 
recent examples of this influence let us just recall that the existence of cosmo- 
logical horizons in these models was one of the crucial points which inspired 
the birth of inflationary cosmology (see e.g. [28]); and the very smooth char- 
acter of the initial singularity has led Penrose [102] to formulate his Weyl 
curvature hypothesis, related to a still unclear concept of gravitational en- 
tropy. Homogeneous but anisotropic Bianchi models filled with perfect fluid 
are extensively analyzed in [311]. Very recent studies of Bianchi models with 
collisionless matter [357] reveal how the matter content can qualitatively alter 
the character of the model. 

A number of Bianchi models approach self-similar solutions. Perfect fluid 
solutions admitting a homothetic vector, which in this case implies both 
geometrical and physical self-similarity, have been reviewed most recently by 
Garr and Goley [358] . In their review various astrophysical and cosmological 
applications of such solutions are also discussed. Self-similar solutions have 
played a crucial role in the critical phenomena in gravitational collapse. Since 
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their discovery by Choptuik in 1993, they have attracted much effort, which 
has revealed quite unexpected facts. In [358] these phenomena are analyzed 
briefly. For a more comprehensive review, see [359] . 

Self-similar, spherically symmetric solutions have been very relevant in 
constructing examples of the formation of naked singularities in gravitational 
collapse (see [358] for a brief summary and references) . In particular, the in- 
homogeneous, spherically symmetric Lemaitre-Bondi-Tolman universes con- 
taining dust have been employed in this context. Solutions with null dust 
should be mentioned as well, especially the spherically symmetric Vaidya so- 
lutions: imploding spherical null-dust models have been constructed in which 
naked singularities arise at their centre (see [32] for summary and references) . 

The Lemaitre-Bondi-Tolman models are the most frequently analyzed 
inhomogeneous cosmological models which contain the standard FRW dust 
models as special cases (see e.g. [28,32]). In his recent book Krasihski [360] 
has compiled and discussed most if not all of these exact inhomogeneous cos- 
mological solutions found so far which can be viewed as “exact perturbations” 
of the FRW models. 

Many solutions known already still wait for their role to be uncovered. 
The role of many others may forever remain just in their “being” . However, 
even if new solutions of a “Kerr-like significance” will not be obtained in 
the near future, we believe that one should not cease in embarking upon 
journeys for finding them, and perhaps even more importantly, for revealing 
new roles of solutions already known. The roads may not be easy, but with 
todays equipment like Maple or Mathematica, the speed is increasing. Is 
there another so explicit way of how to learn more about the rich possibilities 
embodied in Einstein’s field equations? 

The most remarkable figure of Czech symbolism, Otokar Bfezina (1868- 
1929) has consoling words for those who do not meet the “Kerr- type” metric 
on the road: “Nothing is lost in the world of the spirit; even a stone thrown 
away may find its place in the hands of a builder, and a house in flames may 
save the life of someone who has lost his way...”. 
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Abstract. Various aspects of the Gauchy problem for the Einstein equations are 
surveyed, with the emphasis on local solutions of the evolution equations. Particular 
attention is payed to giving a clear explanation of conceptual issues which arise 
in this context. The question of producing reduced systems of equations which 
are hyperbolic is examined in detail and some new results on that subject are 
presented. Relevant background from the theory of partial differential equations is 
also explained at some length. 



1 Introduction 

One of the most striking differences between the Newtonian theory of gravity 
and its successor, general relativity, is that in the latter the gravitational field 
acquires its own dynamical properties. Its time evolution is complicated even 
in the absence of matter. This contrasts with the fact that in the Newto- 
nian theory the field vanishes when no matter is present. The field equation, 
namely the Poisson equation, together with the boundary condition that the 
field vanishes at infinity, which is an essential part of the theory, combine 
to give this result. The Einstein equations, the field equations of general rel- 
ativity, allow idealized situations which represent gravitational waves in an 
otherwise empty universe, without any material sources. This reflects the 
different mathematical nature of the equations involved in these two cases. 
The Poisson equation is elliptic while the Einstein equations are essentially 
hyperbolic in nature. The meaning of the term ’essential’ in this context is 
not simple and explaining it is a major theme in the following. 

In order to understand the special theoretical difficulties connected with 
the Einstein equations and what mathematical approaches may be appro- 
priate to overcome them, it is useful to compare gravitation with electro- 
magnetism. The motion of charged matter can be described within the full 
Maxwell theory. However, there is also another possibility, which is used when 
relativistic effects are small, for instance in many situations in plasma physics. 
Here dynamical matter is coupled to the electrostatic field generated by this 
matter at any given time. In this quasi-static model the electric field follows 
the sources in a passive way while in the full theory there are propagating 
degrees of freedom. As in the case of gravitation, the elliptic equation in the 
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non-relativistic theory (the Poisson equation again) is replaced by a system 
of hyperbolic equations, the Maxwell equations. 

The fact that the Maxwell equations are so tractable is due to their lin- 
earity, a convenient feature not shared by the Einstein equations. The theory 
of linear partial differential equations in general, and of linear hyperbolic 
equations in particular, are much better developed than the corresponding 
nonlinear theories. As a side remark it may be noted that the combined 
equations describing electromagnetic fields together with their sources are 
nonlinear and that in that context serious theoretical problems, such as that 
of describing radiation damping, do appear. 

Solutions of hyperbolic equations can be uniquely determined by their 
values on a suitable initial hypersurface. The Cauchy problem is the task 
of establishing a one to one correspondence between solutions and initial 
data, and studying further properties of this correspondence. The solution 
determined by a particular initial datum may be global, i.e. defined on the 
whole space where the equations are defined, or local, i.e. only defined on a 
neighbourhood of the initial hypersurface. ’Local’ and ’global’ could be called 
local and global in time since in the case where a preferred time coordinate 
is present that is exactly what they mean. 

From what has been said so far we see that in studying the Einstein equa- 
tions we are faced with a system of nonlinear hyperbolic equations. Among 
nonlinear hyperbolic equations in physics, those which have been studied 
most extensively are the Euler equations, and so we may hope to get some 
insights from that direction. At the same time, it is wise to be careful not to 
treat the analogy too uncritically, since the status of the Euler and Einstein 
equations is very different. The Euler equations are phenomenological in na- 
ture and much is understood about how they arise from models on a more 
fundamental level. The Einstein equations have been thought of as represent- 
ing fundamental physics for most of their history and the recent idea that 
they arise as a formal limiting case in string theory will require, at the very 
least, a lot more work before it can offer a solid alternative to this. To return 
to the Euler equations, one of their well-known features is the formation of 
shocks. While there is no indication of a directly analogous phenomenon for 
the Einstein equations, it does draw attention to a fundamental fact. For lin- 
ear hyperbolic equations it is in general possible to solve the Cauchy problem 
globally, i.e. to show the existence of a global solution corresponding to each 
initial datum. For nonlinear hyperbolic equations this is much more difficult 
and whether it can be done or not must be decided on a case by case basis. 
The formation of shocks in solutions of the Euler equations is an example of 
the difficulties which can occur. A general theory for general equations can 
only be hoped for in the case of the local (in time) Cauchy problem. 

For the Einstein equations we must expect to encounter the problem that 
solutions of the Cauchy problem for nonlinear hyperbolic equations do not 
exist globally. In the case of the Einstein equations this issue is clouded by the 
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fact that the distinction between local and global solutions made above does 
not apply. To define the notions of local and global we used the concept of the 
space where the equations are defined. In other words we used a background 
space. As we will see in more detail later, in the case of the Einstein equations 
there is no background space; the space-time manifold is part of the solution. 
It is better in this case to talk only about local and global properties of 
solutions and not about local and global solutions. We may then loosely use 
the words ’local’ and ’global’ to refer to all aspects of the Cauchy problem 
which refer to local and global properties of solutions, respectively. Solutions 
of the Einstein equations present global features such as the formation of 
black holes which are peculiar to this system and which are made possible 
by the lack of a background space. 

In this article we are not concerned with general systems of nonlinear hy- 
perbolic equations, but with a particular one, which is given to us by general 
relativity. Actually, when the coupling to matter fields is taken into account, 
we do get a variety of hyperbolic systems. Nevertheless, we might hope that 
in at least some situations of interest, such as gravitational collapse, the dy- 
namics of the gravitational field would dominate the qualitative behaviour 
and let the effects of the particular matter model fade into the background. 
In any case, it is useful to retain the distinction between the local and global 
Cauchy problems. The global problem is what we want to solve, but the local 
problem is a natural first step. Our original plan was to cover both topics, 
but along the way we discovered that the first step is already so rich that 
on grounds of time and space we have relegated global questions to passing 
remarks. Along the way we stumbled over a variety of ‘well-known’ things 
which turned out not to be known at all, or even to be false. 

The theory of the Cauchy problem allows us to formulate and establish 
relativistic causality within general relativity. Another basic function of the 
solution of the Cauchy problem is to parametrize solutions of the field equa- 
tions in a useful way. To single out the class of solutions relevant to the 
description of a given physical situation, we can single out an appropriate 
subclass of initial data, which is often simpler to do. This does not mean 
that by identifying this class of solutions we have solved the problem. Rather 
it means that we have found a class of problems which may be of physical 
relevance and which it is therefore desirable to investigate mathematically. In 
the end the central mathematical problem is to discover some of the global 
properties of the solutions being studied. As has already been said, this ar- 
ticle is almost entirely restricted to local questions, so that we will say little 
more about this point. However we will return to it in the last section. 

The point of view of the Cauchy problem can also be used to throw light 
on various other issues. For instance, it provides a framework in which it 
can be shown that certain approximation methods used in general relativity 
really provide approximations to solutions of the Einstein equations. It can 
help to provide an analytical basis for the development of efficient and reliable 
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numerical schemes to solve the Einstein equations. The Cauchy problem can 
furnish examples which throw light on general conjectures about solutions of 
the Einstein equations. It can be used to investigate whether certain prop- 
erties of explicit solutions of the Einstein equations of importance in general 
relativity are stable to perturbations. 

The structure of the article will now be outlined. In the second section a 
number of fundamental concepts are introduced and critically reviewed. In 
particular, the splitting of the field equations into evolution equations and 
constraints is described. The rest of the article concentrates on the evolu- 
tion equations. The question of how to solve the constraints on the initial 
hypersurface is not considered further. The second section presents the basic 
elements which go into solving the local Cauchy problem. It discusses in par- 
ticular the question of gauge freedom and how to show that the constraints 
are satisfied everywhere, given that they are satisfied on the initial hyper- 
surface (propagation of the constraints). These are aspects of the question 
of hyperbolic reduction, i.e. how questions about the Cauchy problem for the 
Einstein equations can be reduced to questions about the Cauchy problem 
for hyperbolic equations. 

The solution of the Cauchy problem relies on the use of techniques from 
the theory of partial differential equations (PDE) . The third section presents 
some of the relevant techniques and attempts to explain some of the im- 
portant concepts which play a role in the theory of the Cauchy problem for 
hyperbolic equations. We have chosen to concentrate on symmetric hyper- 
bolic systems rather than on other kinds of hyperbolic equations such as 
nonlinear wave equations, which could also be used as a basis for studying 
the Cauchy problem for the Einstein equations. It will be seen that symmet- 
ric hyperbolic systems provide a very flexible tool. A comparative discussion 
of various notions of hyperbolicity is also given. 

In the fourth section we return to the question of hyperbolic reduction. 
Different ways of reducing the Einstein equations in 3-1-1 form are presented 
and compared. The ADM equations, which were already introduced in Sect. 
2, are discussed further. A form of these equations which has proved successful 
in numerical calculations is analysed. A number of other illustrative examples 
are treated in detail. The first example is that of the Einstein-Euler system 
for a self-gravitating perfect fluid. The case of dust, which is significantly 
different from that of a fluid with non- vanishing pressure, and which leads 
to serious problems in some approaches, is successfully handled. The second 
example is a variant of the pure Cauchy problem, namely the initial boundary 
value problem. The third example is the Einstein-Dirac system, which gives 
rise to particular problems of its own. 

The fifth section starts with a discussion of the proofs of the basic state- 
ments of local existence, uniqueness and stability for the Einstein equations, 
based on the PDE theory reviewed in the third section and a particular hy- 
perbolic reduction already introduced in Sect. 2. Then the extension of these 
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results to the case where different kinds of matter are present is sketched. 
Existence, uniqueness and stability constitute together the statement that 
the Cauchy problem is well-posed. The significance of this is underlined by 
an example of a situation where the Cauchy problem is ill-posed. Finally it 
is shown how existence and uniqueness imply the inheritance of symmetries 
of the initial data by the corresponding solutions. 

The choice of topics covered in this article is heavily dependent on the 
interests of the authors. In the last section we list a number of the important 
topics which were not discussed. While this list is also influenced by personal 
taste, we hope that it will provide the interested reader with the opportunity 
to form a balanced view of the subject. 

2 Basic Observations and Concepts 

In this section we give an introductory survey of various aspects of the field 
equations. Most of them will be discussed again, in greater detail, in later 
sections. 

Gravitational fields represented by isometric space-times must be consid- 
ered as physically equivalent and therefore the field equations for the metric 
must have the property that they determine isometry classes of solutions, not 
specific coordinate representations. This is achieved by the covariant Einstein 
equations 

It is often said that many of the specific features of the Einstein equations are 
related to this covariance. One may wonder, however, what is so special about 
it, since the wave equation, the Yang-Mills equations, the Euler equations, 
just to name a few examples, are also covariant. The difference lies in the 
fundamental nature of the metric field. While the examples just quoted are 
defined with respect to some background structure, namely a given Lorentz 
space (in the case of the Yang-Mills equations in four dimensions a conformal 
structure suffices), the Einstein equations are designed to determine Lorentz 
manifolds without introducing any extraneous structures. Consequently, the 
solutions of the equations themselves provide the background on which the 
equations are to be solved. 

This makes it evident that Einstein’s equations can at best be quasi-linear. 
That they are in fact quasi-linear can be seen from their explicit expression. 
Writing the equation in the form 

( 2 . 1 ) 

the principal part of the differential operator of second order which acts on 
the metric coefficients is given by the left hand side. In arbitrary coordinates 
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we have 






\p 



d^9pu d^gxp d'^g^p d^gp^ \ 

dx^ dxP dxP dx'^ dx^ dx'^ dx^ dx^ j ^ ^ 



( 2 . 2 ) 

where Q denotes a rational function of the metric coefficients and their first 
order derivatives. We see that the equations are quasi-linear but not better, 
i.e. they are linear in the derivatives of highest order but these derivatives 
come with coefficients which are given by the unknown itself. The Euler 
equations are also quasi-linear in this sense. However, they are studied in 
general on an independent background space-time in terms of which their 
solutions may be analysed. 



2.1 The Principal Symbol 

Consider a system of k partial differential equations of order m for an Re- 
valued unknown u defined on an open subset U of M" . Suppose that in coor- 
dinates /X = 1, . . . , n on [/ it takes the form 

Pu= A‘^D°‘u = f. (2.3) 

|a|<m 

Here the A°‘ denote smooth real k x fc-matrix-valued functions, f{x) is a 
smooth Re-valued function, the a = (oi, . . . , an), with non-negative integers 
aj, are multi-indices, |a| = oi -I- . . . -I- a„, and D°‘ = i9“i . . . 9“,?. Assume, 
first, that the equations are linear, so that = A“(x). 

The question of the formal solvability of the equation leads to the impor- 
tant notion of a characteristic. Suppose that H = {<P = const., yf 0} is a 
hypersurface of U, defined in terms of a smooth function <1>. A Cauchy data 
set on H for equation (2.3) consists of a set of functions uq, wi, . ■ ■ 
on H. Let the coordinates x^ be chosen such that <P = x^ near H. Interpret- 
ing the functions Ui as the derivatives of a solution u to our equation, 
defined in a neighbourhood of H, we ask whether equation (2.3) allows the 
function d'^u to be determined uniquely on H from the Cauchy data. Since 
all functions u with a\ < m — 1, can obviously be derived from the data, 
it follows that we can solve the equation on H for 9^ u if and only if the 
matrix is invertible at all points of H. Using for any covector 

and multi-index a the notation we observe that with our 

assumptions invertibility of the matrix 

on the right hand side is independent of the coordinates chosen to represent 
equation (2.3) and the function used to represent H. This follows from 
the transformation law of the coefficients of the differential equation and of 
covectors under coordinate transformations. 

We are thus led to introduce the following covariant notions. For a cov- 
ector ^ at a given point x € M, we define the principal symbol of (2.3) 
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at (x,^) as the matrix cr{x,^) = J2\a\=m associated linear 

map). The reader should be warned that a definition with replaced by 
where i is the imaginary unit, is often used, since it fits well with the Fourier 
transform. The latter convention will not be used here. A hypersurface H, 
represented by a function <1> as above, is called nowhere characteristic for 
(2.3), if det(cr(a::, c?^)) yf 0 for a; G and we say that iJ is a characteristic 
hypersurface, or simply a characteristic for (2.3), if det{a{x,d<!>{x))) = 0 for 

X G H. 

Given Cauchy data on a hypersurface H which is nowhere characteristic, 
we can determine a formal expansion of a possible solution u in terms of 
x^ on H by taking formal derivatives of (2.3) and solving for u on H, 
i = TO,TO+ 1,.... Conversely, if H is characteristic, Cauchy data cannot 
be prescribed freely on H , because the equation implies relations among the 
Cauchy data on H. We refer to these relations as the inner equations induced 
by the equations on the characteristic. 

If the rank of the matrix cr(a;, d '^(x)) is k— j with some positive integer j, 
there are j such relations. The principal symbol is not sufficient to describe 
the precise nature of these relations. The complete information of (2.3) is 
needed for this. 

If equation (2.3) is quasi-linear, so that the coefficients of the equation do 
not only depend on the points of M but also on the unknown and its deriva- 
tives of order less than m, we have to proceed slightly differently. Suppose 
M is a solution of the quasi-linear equation (2.3). For given covector f at the 
point X G M we define the principal symbol of (2.3) with respect to u as the 
matrix a{x,i) = ■ ■ ■ > '“(*)l|/ 3|=m-i) and use it to 

define, by the condition above, the characteristics of (2.3) with respect to u. 
Thus for quasi-linear equations the characteristics depend on the solution. 

We return to the Einstein equations. Assuming that (M, 5) is a solution of 
(2.1) of dimension n, we find that for given covector f at x G M the principal 
symbol of the operator in (2.2) defines the linear map 

kfii/ t ((J * k'^ fj,i/ 9 ^ ^pp Ca k\i^ 5 

(2.4) 

of the set of symmetric covariant tensors at x into itself. If is in its kernel, 
we have 

0 = Cp e + fcpA ^p - k^p e 6 - k,p fp, (2.5) 

from which we see that tensors of the form kp,y = + Pp, with arbitrary 

covector rj, generate an n-dimensional subspace of the kernel. This subspace 
coincides with the kernel if Cp?^ 0- If Cp^^ = Oj Cp Oj equation (2.5) 
takes the form ^pPu + ^^Pp = 0, with some covector rjp. Since ^p ^ 0 we 
must have = 0 or, equivalently, I gP^ kpx = kp,j fP. The solutions, given 
by kpi, = hpv+a {fp rjp) with a G K, ry* a fixed covector with fP rj* ^ 0, 

and hpy satisfying hp,j = 0, gP'" hp^ = 0, span a space of dimension 
which is strictly larger than n if n > 4. 
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It follows that any hypersurface is characteristic for the Einstein equa- 
tions. In the case of spacelike or timelike hypersurfaces n relations are im- 
plied on Cauchy data sets, in the case of null hypersurfaces there are ^ 
additional relations. The fact that the rank of the principal symbol drops fur- 
ther for covectors ^ which are null has the immediate consequence that the 
inner equations induced on null hypersurfaces are completely different from 
the inner equations induced on nowhere characteristic hypersurfaces. This is 
related to the fact that null hypersurfaces (and appropriate generalizations 
admitting caustics) may represent wave fronts, swept out by high frequency 
perturbations of the field. The propagation of the latter is governed by the 
inner equations induced on these hypersurfaces (cf. [27] and, for a modern 
discussion, also [54], [88] for the mathematical aspects of this statement). 

The fact that on any hypersurface the tensors of the form + 

Vfi ^re in the kernel of the principal symbol map at (x, ^) can be deduced by 
an abstract argument which uses only the covariance and the quasi-linearity 
of the equations (cf. [12], [64]). This emphasizes again the special role of null 
hypersurfaces. Assume that (f>r is the flow of a vector field X. If we denote 
by (j)* the associated pull-back operation, we have 

<('r(Ric[ff]) = Ric[</>*(ff)]- 

Taking derivatives with respect to r we obtain 

Cx Ric[5] = RiCg[£x g], 

where the right hand side denotes the (at g) linearized Ricci operator applied 
to the Lie derivative Cxg^i/ = -I- of the metric. Considering 

this equation as a differential equation for X, the principal part is given by 
the terms of third order appearing on the right hand side. Since the equation 
is an identity, holding for all vector fields, these terms must cancel. It follows 
that the expressions X^ are in the kernel of the principal symbol 

of the linearized and thus also of the non-linear Ricci operator. 

For the further discussion it will be convenient to use coordinates related 
to a given hypersurface. For simplicity and since it is the case of most interest 
to us, we will assume the latter to be spacelike and denote it by S. Let T be 
a non-vanishing time flow vector field transverse to S (but not necesssarily 
timelike) and t = a, function with {t = 0} = S' and < dt,T > = 1. 

We choose local coordinates x“, a = 1,2,3, on S and extend them to a 
neighbourhood of S such that < dx“,T > = 0. In these coordinates T = dt 
and the metric has the ADM representation 

g = —{adtY + hab {fl°' dt + dx°‘) dt + dx^), (2.6) 

where h = h^b dx°‘ dx^ represents the induced Riemannian metric on the slice 
S. To ensure the non-degeneracy of the metric we assume the lapse function 
a to be positive. We write (3 = = (3°' da for the shift vector field, which 
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is tangent to the time slices. We assume further that the metric h induced on 
the time slices is Riemannian and we denote its contravariant form by , 
so that hbc = 5°' c- 

The (future-directed) unit normal to the time slices is given by = 
— /3^), whence = —aS^^. We write ^ ^ n^, for 

the orthogonal projector onto the time slices, for the 4- 

dimensional representation of the interior metric on the time slices, and use 
g to perform index shifts. The second fundamental form (extrinsic curvature) 
of the time slices is given by 



X/ii/ — ^/i 



^ X hfi hi, ^ £,ri gxp — „ Hn h 



n 



For tensor fields intrinsic to the time slices, i.e. having vanishing con- 

tractions with n, the /i-covariant derivative H on a fixed time slice is given 

by 

Dptp...’' = h\h^ . 

Finally, we write n{f) = dp /, note that = n'^ Vj, rip = Dp log(a), 

and use the coefficients Ip'' p = h'x'^ tt h^ ph'' p which represent the co- 

variant derivative D in the sense that the components 7a ^ c are the Christof- 
fel symbols of the metric hat- The connection coefficients of gpi, can then be 
written 



rp'' p = ripn'' rip n(log a) + rip a'' rip - Up n'' rip - rip rf ap 



(2.7) 



+rip rip - P'' -rip- P'' p- rip- P’' p 
a a a 

+n'' Xiip -n-pX'' p-Xp.'^ rip + jp‘' p. 



2.2 The Constraints 



To isolate the geometrically relevant information contained in the Cauchy 
data, we reduce the coordinate freedom tranverse to S by assuming T to be 
a geodesic unit vector field normal to S. The metric then takes the form 

g = —dt'^ + habdx°' dx^, (2.8) 



and the pull-back of the second fundamental to the time slices is given by 



Xab — 



2 hab- 



(2.9) 



Expressions (2.8) and (2.9) suggest that the essential Cauchy data for the 
metric field are given by the induced metric hab and the second fundamental 
form Xab on the spacelike hypersurface S. Since coordinate transformations 
are unrestricted, the n inner relations induced on general Cauchy data on a 
spacelike hypersurface S must be conditions on the essential Cauchy data. 
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If we contract the covector which corresponds in our context to the 
normal n of S, with the principal symbol of the Einstein tensor G^i, = — 

5 R 9tj.i' and evaluated on k, we get 

C (^(o- • k)^^ - i g^P {a ■ k)xp'^ = 0. 

This identity indicates the combinations of the equations which contain only 
derivatives of first order in directions transverse to S. Indeed, if we express 
the equations 

0 = = n.P n'' + Xg^^ - 

Q = Z^ = nP KP{Gpp + Xgpp - 

on S in terms of h and y, write p = nP n'' Tp^, j^, = —nP hP ^ Tpp, and pull- 
back to S, we get the constraint equations on spacelike hyper surf aces, i.e. the 
Hamiltonian constraint 

0 = 2 Z" = r - xab + (Xa “)' - 2 A - 2 K p, (2.10) 

and the momentum constraint 

0 = Z^ = -DbXa‘^ + ^Jb, ( 2 . 11 ) 

where r denotes the Ricci scalar and D the connection of h. 

These equations have the following geometric meaning. On a spacelike 
hypersurface S the curvature tensors RP ,^prj and rP of g and h respectively 

and the second fundamental form Xpi> £^re related, irrespective of any field 
equation, by the Gauss equation 

upr, = hP xR^ h'^ ph'* ri~X^ p Xuu + X^ n Xi'p (2-12) 

and the Codazzi equation 

Dp Xvv - Du Xm = n v- (2.13) 

The constraint equations follow from (2.12) and (2.13) by contractions, the 
use of the field equations, and pull-back to S. Thus the constraints represent 
the covariant condition for the isometric embeddibility of an initial data set 
{S,hab,Xab, P,ja) luto a solution of the Einstein equations. 

We note here that the constraints (2.10) and (2.11), which are analogues 
of the constraints of Maxwell’s equations, have important physical conse- 
quences. One of the most important of these is the positivity of the mass 
which can be associated with an asymptotically flat initial data set (subject 
to reasonable conditions) [82] , [94] . 
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2.3 The Bianchi Identities 

Before analysing the structure of the field equations further, we note some 
important identities. The Riemann tensor Riem[g] of the metric g, given by 

= + (2.14) 

where Fp^ i, denotes the Christoffel symbols of has the covariance prop- 
erty 

Riem[(^*(g)] = (/)*(Riem[g]), 

where 4> denotes a diffeomorphism of M into itself. Two important identities 
are a direct (cf. [64]) consequence of this, the first Bianchi identity 



\up + R^ pXv + R^ upx — 0, 


(2.15) 


and the second Bianchi identity 




V p E? xvp + Vp W xpi, + Vi/ R? xpp = 0. 


(2.16) 


The latter implies the further identities 




Vp RP vXp = Va R^p — Vp Rvx^ 


(2.17) 


< 

1 

tOl 

<1 

II 

o 


(2.18) 



The second Bianchi identity will serve us two quite different purposes. 
Firstly, it will allow us to resolve certain problems arising from the degeneracy 
of the principal symbol considered above (it is the integrability condition 
which allows us to show the propagation of suitably chosen gauge conditions 
and the preservation of the constraints). Secondly, it will provide us with 
alternative representations of the field equations. 

2.4 The Evolution Equations 

In this section we shall discuss a few basic ideas about the evolution problem. 
Our observations about the constraints and the decomposition of 

^ pu = G pu -\- A gpu ^ Fpu 5 

given by 

Zp„ = Z^^-n„Zp -UpZ^ + npU^Z^ , (2.19) 

with Z^^ = hp^hP Z\p^ suggest that the basic information on the evolution 
equations should be contained in 



= 0 , 
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or any combination of it with the constraints. To obtain simple expressions 
in terms of the field and the second fundamental form defined by 
the generalization 

Crhf_t^ = 2axtii^ + ( 2 . 20 ) 

of (2.9), we consider the equation 

0 = ^ V - Z^) (2.21) 

“ (■^T Xpv Xpf Z)fj, Dv a) + + Xp ^ Xpv 2 Xpp Xv ^ 

a 

-Xh^^ - {Tpx ~^T gpx). 

Together with (2.20) it should be regarded as an evolution equation for the 
fields Xpj^' 

It then appears natural to analyse the general solution of the Einstein 
equations by the following procedure: Find initial data, i e. a solution hab, 
Xab of the constraints, on the slice S = {t = 0}. Then find the solution hab, 
Xab of the equations 

dthab = 2axab + Cfjhab, (2.22) 

dt Xab — O ij" ab T Xc Xab 2 Xao Xb ) (2.23) 

X-Da Db a + LfS Xab + a (A hab + K {Tab ~ 2 ^ hab)), 

equivalent to (2.20) and (2.21), which induces these data on S. The first 
step will not be considered further in this article; we shall give some relevant 
references on the problem of solving the constraint equations in Sect. 6. Here 
we want to comment on the second step, which raises several questions. 

i) What determines the functions a, /3“? Is it possible to prescribe them, 
at least locally near S, as arbitrary functions a = a{t,x‘^), /3“ = /3“(t,a:°) of 
the coordinates, possibly with the restriction o? — [3c {3^ > 0, which would 
make dt timelike? We could give a positive answer to this question, if, starting 
from a representation of the metric (2.6) in terms of some coordinate system 
with t' = x^ =0 on S, we could always find a coordinate transformation 
t = t{x^ ), = x°“{x^ ) with t(0,x°' ) = 0, which casts the metric into the 

desired form, i.e. achieves 



1 

a^(t, x^) 



= 5 ™ = 9^ '"' {x^')t,p' ty, 



/3“(t, x^) = = 9>^'Z{x^')t,p, ,, 



(2.24) 

(2.25) 



If the left hand side of the first equation only depended on t and x^ , the 
standard theory of first order PDE’s for a single unknown could be applied 
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to this equation and second equation would essentially reduce to an ODE. 
However, in general this theory does not apply, because the dependence of 
the function a on x°" introduces a coupling to the second equation. 

ii) Suppose we could prescribe lapse and shift arbitrarily. Could we then 
show the existence of a (unique) solution hab, Xab of the initial value prob- 
lem for the equations (2.22) and (2.23) (possibly coupled to some matter 
equations)? 

iii) Suppose we could answer the last question positively, would the re- 
sulting solution to (2.22) and (2.23) then satisfy the constraints (2.10) and 
(2.11) on the slices {t = const.}? Only then would we know that the metric 

obtained from our fields hab, Xab, ct (3^ by (2.6), is a solution of the 
Einstein equations. 

We can answer question (iii) as follows. Suppose equation (2.21) is satisfied 
on a set M =] — a, c[xS', with a, c > 0, and the solution induces the given 
data on {0} x S', which we assume to be identified with S in the obvious way. 
Since (2.21) is equivalent to = 0, we can write on M by (2.19) 

= -rij. Zjf -nf,Z^ + {2n^ n,, + Z^ . 

Taking the divergence, using the contracted Bianchi identity (2.18), assuming 
that the matter field equations have been given such as to ensure T^i, = 0, 
and splitting into normal and tangential parts, we get the equations 

n^^ Z^ - D^Z^ = 2 Z^ V^n'' -2 Z^ n'", 

Z^ -D,Z^ = -Z^ n, - Z^ 

+Z^ n,, V^nP + 2 Z^ n^, 

which imply subsidiary equations, satisfied by Z^ and Z^ , 

where h, denote linear functions of Z^ , Z^ . Since it is a system for 
V = *{Z^ , Z^) of the form 

AP d^v + Bv = Q, 

with symmetric matrices and a positive definite matrix A^ , it is symmetric 
hyperbolic (cf. Sect. 3.1). Moreover, it has characteristic polynomial 

det{AP e^) = - det(/i“") (nP (2.27) 

which implies that its characteristics are hypersurfaces which are timelike or 
null with respect to the metric g^y. 

Consequently, if S' = {(j> = 0} C M, with (j) G C°°{M), is 

a spacelike hypersurface, the matrix A^ (j)^^ is positive definite on S'. Sup- 
pose S' , S" are two spacelike hypersurfaces which intersect at their common 
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2— dimensional boundary Z and bound a compact ‘lens-shaped region’ in M . 
Then it follows from the discussion of symmetric hyperbolic systems in Sect. 
3.1 that the fields must vanish on S" if they vanish on S' . 

To make a precise statement about the consequences of this property, 
we need the important notion of the domain of dependence. Let us assume 
that there is given a time orientation on {M,g). If t/ is a closed subset of 
M we define the future (past) domain of dependence of U in M as the set of 
points X G M such that any 5— non-spacelike curve in M through x which is 
inextendible in the past (future) intersects U. We denote this set by D^(U) 
(resp. D~{U)). 

It can be shown that the result about lens-shaped regions referred to above 
and the fact that the fields Z^ , Z^ vanish on S imply that Z^ and Z^ , 
whence also Z^ , vanish on the domain of dependence D{S) = D'^{S)UD~{S) 
of S in M . This shows the preservation of the constraints under the evolution 
defined by equations (2.22) and (2.23) and the prescribed lapse and shift. 

Questions (i), (ii) are more delicate. Let us assume that the coefficients 
gfj, V ^ analytic functions for in an open subset V of with 

yn{x° = 0} ^ 0, and that a > 0 and /?'” are real analytic functions of t and 
x“. Then equations (2.24) and (2.25) can be written in the form 



i.o' = T’i(t,x“,f,c',a;“ ), x“ ,0' = F 2 {t,x°‘,t^c',x°‘ ), 



with functions F\, F 2 which are real analytic for (t, x", t^,x‘' ,c', ) G x 

V. Thus, by the theorem of Cauchy-Kowalewskaya (cf. [30]), the differential 
problem considered in question (i) can be solved in a neighbourhood of the set 
{x° = 0} C 1^. Using the covariance of equations (2.24), (2.25), it follows that 
given a real analytic Lorentz space (M, g), an analytic spacelike hypersurface 
S va M with coordinates x“ on 5, and analytic functions a = a{t,x") > 0, 

= /3“(t,x'^) there exist unique real analytic coordinates t, x“ on some 
neighbourhood of S' in M such that t = 0 on S and the lapse and shift in the 
expression of g in these coordinates are given by a and jd" . 

Assume now that the 3— manifold S, the initial data hab, Xab solving the 
constraints on S, as well as the functions a{t,x") > 0, /3“(t,x'’’) are real 
analytic and assume for simplicity that = 0. Then we can derive from 
(2.22) and (2.23) a differential system for u = {hab,kabc = hab, c, Xab) of 
the form dfU = Fl(u,t,x°‘) with a function F[ which is real analytic where 
det(hab) yf 0. Again the theorem of Cauchy-Kowalewskaya tells us that this 
system, whence also (2.22) and (2.23), has a unique real analytic solution on 
R X S' near {0} x S for the data which are given on {0} x S after identifying 
the latter in the obvious way with S. By our discussion of (iii) we know that 
we thus obtain a unique analytic solution to the full Einstein equations. 

It should be noted that the solution obtained in this way depends a priori 
not only on the data but also on the chosen lapse and shift. That the latter 
do in fact only affect the coordinate representation of the solution can be seen 
as follows. Given another set of analytic functions a'{t',x “) > 0, /3 “(t',x '’), 
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we can either find, as remarked above, a coordinate transformation t — >■ t' , 
a;“ — >■ a; “ such that the given metric has the new values of lapse and shift in 
the new coordinates , or we can deal with the initial value problem for (2.22) 
and (2.23) with the new lapse and shift. However, due to the uniqueness of 
this solution it must coincide with the first solution in its new coordinate 
representation. 

It is a remarkable fact that in the course of solving the Einstein equations 
we can prescribe rather arbitrarily four functions a, /3“ (in the analytic case) 
which are considered first as functions on some abstract but which, once 
the solution has been constructed, acquire the meaning of lapse and shift for 
the coordinate expression of the metric. Since the coordinates in which they 
have this meaning are defined by a, /3“ implicitly (via the field equations), 
we refer to these functions as the gauge source functions of our procedure 
(we shall see below that, depending on the chosen equations, quite different 
objects can play the role of gauge source functions) and to the act of pre- 
scribing these functions as imposing a gauge condition. The considerations 
above show also that the manifold on which the solution is constructed must 
he regarded as part of the solution. The transition functions relating the dif- 
ferent coordinates we have considered, as well that the domains of definition 
of these coordinate systems themselves, are determined by the gauge source 
functions, the field equations, and the initial data. 

We have seen that in the case where the data and the given lapse and shift 
are real analytic, we can answer our questions in a satisfactory way. However, 
the assumption is not satisfactory. This is not meant to say anything against 
analytic solutions. In fact, most of the ‘exact solutions’ which are the source 
of our intuition for general relativistic phenomena are (piecewise) analytic. 
However, we should not restrict to analyticity in principle. One reason is that 
it would be in conflict with one of the basic tenets of general relativity. Given 
two non-empty open subsets U, V of a, connected space-time (M,g) such 
that no point of U can be connected by a causal curve with a point of V, any 
process in U should be independent of what happens in V. However, if the 
space-time is analytic, the field in V is essentially fixed by the behaviour of g 
in U. For instance, we would not be able to study the evolution of data hab, 
Xab on a 3— manifold S where hab is conformally fiat in some open subset of 
S but not in another one. 

Therefore, Einstein’s equations should allow us to discuss the existence 
and uniqueness of solutions, and also the continuous dependence of the lat- 
ter on the data (stability), in classes of functions which are C°° or of even 
lower smoothness. In other words, Einstein’s equations should imply evo- 
lution equations for which the Cauchy problem is well posed (cf. [45], [49], 
[91]). Whether an initial value problem is well posed cannot be decided on the 
level of analytic solutions and with the methods used to prove the Cauchy- 
Kowalewskaya theorem. On this level there is no basic distinction between 
initial value problems based on spacelike and those based on timelike hy- 
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persurfaces, though in the latter case the stability property is known not to 
be satisfied. Thus we are led to search for evolution equations which satisfy 
some ‘hyperbolicity condition’, i.e. a condition (essentially) on the algebraic 
structure of the equations which entails the well-posedness of the Cauchy 
problem. 

A number of different hyperbolicity conditions are known, all of them 
having in common that they require the equations to admit at each point a 
maximal number of ‘real’ characteristics: if the equations have a local ex- 
pression of the form (2.3), then the operator P is hyperbolic at x G U 
only if there is a covector ( G T* M \ {0} such that every 2— dimensional 
plane in the cotangent space T* M containing C intersects the conormal cone 
{det{J2\a\=m = 0} in fc X m real lines (counting multiplicities) (cf. 

[27] ) . But notice that this condition alone does not ensure the well-posedness 
of the Cauchy problem. Some further remarks about different notions of hy- 
perbolicity can be found in Sect. 3.3. 

To analyse the situation in the case of (2.22) and (2.23), we solve (2.22) 
for Xab and insert into (2.23) to obtain a system of second order for hab which 
takes the form 

^ {df hab - /?" da dt hab ~ dt da hab + P’^ da hab} (2.28) 

-/l'”’* {da dd hab + da db had ~ da da hbd ~ db da had) 

-- dadba- - {ha{a db) dt P‘" - P’^ ha(a db) dd P'") 

a a 

= terms of lower order in hab, ot, P^. 

To analyse the characteristic polynomial, we have to know how a and /3° 
are related to the solution hab- Suppose, for simplicity, that a and /S'” are 
given functions.. Then we have to calculate for given covector ■Cm 
determinant of the linear map 

hab ' t kab — 3^ ^fi kab h kad Ca “f 2 C ^c(a Cb) : 

of symmetric tensors, where we set C“ = h°"^ ^b- Denoting by A'(C) the linear 
transformation which maps the independent components kab, o, < b, onto the 
kab, a < b, we find 

det A'(C) = -(n^CM)®(ff'^"eMe.)'- 

Thus, if we consider lapse and shift as given, the conormal cone of system 
(2.28) satisfies the condition required by hyperbolicity. Moreoever, the char- 
acteristics are timelike or null as one would expect for the evolution equations 
of a theory which is founded on the idea that physical processes propagate 
on or inside the light cone of the metric field. 

However, in spite of the fact that these equations have been used for a 
long time and in various contexts, and in spite of the naturality of equation 
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(2.23) which appears to be indicated by the hyperbolicity of the subsidiary 
equations, it is not known whether the Cauchy problem for equations (2.28) 
is well posed. It appears that we need to make use of the constraint equations 
to obtain suitable evolution equations. 

Therefore we proceed along a different route. While the form of the con- 
straint equations induced on a given hypersurface is unique, there is a huge 
freedom to modify our evolution equations. We can try to bring the principal 
part of (2.28) into a suitable form by using the constraints, by assuming lapse 
and shift to be functionals of the metric, or by subjecting them to some equa- 
tions. Then gauge source functions of quite a different nature may appear. 
Our choice in this section is motivated by the following two observations ([65], 
[16], cf. also [36]): 

(i) Suppose S is some spacelike hypersurface of some Lorentz space (M, g) 

and are coordinates on some open subset U of S. Let {x’^ ) be 

four smooth real functions defined on K.^. Then there exist coordinates x" 
on some neighbourhood of in M with x^ = 0, x^ = x“ on [/ and such 
that the Christoffel coefficients of g in these coordinates satisfy the relations 
F^'(x'^') = F'"'(x^'), where F'"' = g^'p' Fx, p p,. 

Obviously, one can construct coordinates x'^ on some neighbourhood of U 
by solving Cauchy problems for the semi-linear wave equations x'^ = 

—FP {xP ) with Cauchy data on U which are consistent with our require- 
ments. When the wave equations are expressed in these coordinates the re- 
lations above result. 

(ii) The 4— dimensional Ricci tensor can be written in the form 

R/J.V = (2.29) 

+rx\gr,sg^^rp\ + 2Fs\g^Pgx^pF,,)%, 

where we set 



Riy = 9vp. V^F,, = dpFy — Fp^ i, F\. 

Thus, if we consider the Fj^ as given functions, the Einstein equations take 
the form of a system of wave equations for the metric coefficients. 

Before we show that these observations lead to a short and elegant argu- 
ment for the well-posedness of the initial value problem for Einstein’s equa- 
tions (assuming well behaved matter equations), we indicate how the form of 
equations (2.29) relates to our previous considerations. 

From the expressions (2.7) we get the relations 

dta- a^a / 3 “ = (x ~ n,,F''), 

dt /3“ - /3“ .6 (3^ = a" (7“ - log a-h%F'^), 



(2.30) 

(2.31) 
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which indicate that prescribing the functions may fix the evolution of 
lapse and shift. Writing the 3— Ricci tensor similarly to (2.29) 

^ab — 2 ^ ^ab,cd F .^(aTb) F a ^fd ^ be ^ b F ‘2 e h ^ bb) / ; 

with 

ba = Kb b*' = Kb d = K’^ {Kc,d - ^ /Jcd.a) , (2.32) 

we obtain (2.28) in the form 

^ (a^ hab - 2 Kb,tc + Kb,cd /3" /S'") - K‘^ Kb,cd 

D(a [K)c (dt - p'^ dd P‘")] + 2 £>(abb) -2DaDb log a 

= terms of lower order in hab^ a, P^. 

Using (2.31) to replace the terms of second order in the second line, we get 

-5'^" Kb,b.. = -2 D(,{K)c K ^ FK (2.33) 

+ ^ {Da aDba + \ab {dt a - Cf^a) +2 Di^aCt K)c {dtP‘' ~ P‘" ,d P‘^) 

+2 /3° ,(a K)c,t + hc{a P^ ,b) P'^ ,d ~ P'^ ,(a K)d{ 

+ 2 {2 Xac Ab - Xc Xab - be a hfd 7e ^ b 

-2 7c e K^ hd(a lb)"" f + ^Kb + K. {Tab “ ^ Kb) | , 

which can be read as a wave equation for hab- From (2.30) and (2.22) we get 

^ {dt - P^dc)'^ a = {dt - /3^ 9c) X + 2 a x^ - 5 X n^T'^ {2M) 

+3 (n,r")2 - a (9t - /3" 9c) (n,U"). 

Taking the trace of (2.23) and using (2.22) and the Hamiltonian constraint 
(2.10), we get 

{dt - /3^ 9c) X = -D“ a - a jxab X“" - A + «: i (p + Tab) | , (2.35) 

whence, using (2.34), the wave equation 

A {dt - P^ 9c)2 a- DaD- a = -a [xab x“' - A (2.36) 

+/t i (p + Tab) + 5 a X n^F'' - 3 (n^T")^ + (9* - /3^ 9c) (n,.T") | . 
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From (2.32), (2.22), and the momentum constraint (2.11) follows 

{dt - /3" d,) 7“ = /3“ + a + 2 a (2-37) 

+2 D,a - X 77“a - 2 a x"Sc “ d - /?“ .c 7^ 

which implies together with (2.30), (2.31), and (2.22) the wave equation 

^ {dt - r /3“ - /3“ = 2 a K j“ (2.38) 

+4 (x“" - X hn D,a-2a (x"" - X 7fc “ c - ,c 7^ + ,c log a 

-2an^ r’' {-f‘^-D‘^\oga-h‘^^r'')-2axh‘^^r’^ 

+D-{an^ rn - {dt - d,){h<^ , T"). 

Equations (2.33), (2.36), and (2.38) form a hyperbolic system for the fields 
hab, a, /3“ if we consider the functions as given. We note that besides 
(2.23) we used the constraints as well as (2.7) to derive this system. 

We return to the evolution problem. Suppose we are given smooth data 
hab, Xab, P, ja, i- e. a solution of the constraints, on some 3— dimensional 
manifold S, which, for simplicity, we assume to be diffeomorphic to and 
endowed with global coordinates x“. Following our previous considerations, 
we set M = R X S', denote hy t = the natural coordinate on R and extend 
the a;“ in the obvious way to M. We embed our initial data set into M by 
identifying S diffeomorphically with {0} x S (the need for this embedding 
shows that it is in general not useful to restrict the choice of the coordinates 
cc“). 

We now choose four smooth real functions Ffa{x^) on M which will be 
assigned the role of gauge source functions in the following procedure. As sug- 
gested by (2.29) and the preceeding discussion, we study the Cauchy problem 
for the reduced equations 

- ^ + V(^ E,) + A % grts 9^^ Fp ^ , (2.39) 

+2 Fs^ Ti ^ p = ^ (7)u/ ~ 2 '^ 9uv)- 

Since we do not want to get involved in this section with details of the mat- 
ter equations, we assume the Cauchy problem for them to be well posed 
and the energy-momentum tensor to be divergence free as a consequence of 
these equations (one of the remarks following formula (4.70) shows that the 
situation can be occasionally somewhat more subtle). 

To prepare Cauchy data for (2.39), we choose on S a smooth positive 
lapse function a and a smooth shift vector field /3“, which determine with 
the datum hab the ADM representation (2.6) for g^^, on S. Using now hab, 
Xab, ct, /3“ in equations (2.30), (2.31) (with F^ replaced by g''^ Fp), and 
(2.22), we obtain the corresponding datum dtg^u on S. 
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It will be shown in Sect. 5 that the Cauchy problem for (2.39) and the 
initial data above is well posed. Thus there exists a smooth solution to it 
on some neighbourhood M' of S. To assure uniqueness, we assume that M' 
coincides with the domain of dependence of S in M' with respect to 

With the functions calculated from g^^, the reduced equation (2.39) 
takes the form 



G pii/ \ g^iy ^ 2 ^ p 

with = Ffi — Ffj^. To see that our gauge conditions are preserved under 
the evolution defined by (2.39) and our gauge source functions, we show that 

= 0 on M'. 

Using the Bianchi identity (2.18) and our assumptions on the matter equa- 
tions, we get, by taking the divergence of the equation above, the subsidiary 
equation 

D, + R>^,D^ = 0 . 

Since we used (2.30) and (2.31) to determine the initial data, we know that 

= 0 on S'. Moreover, equations (2.36)and (2.38) may be used, on the one 
hand, to calculate dt F^ from but they are, on the other hand, contained 
in (2.39), with T), replaced by This implies that dt = 0 on S. Using the 
uniqueness property for systems of wave equations discussed in Sect. 5.1, we 
conclude that vanishes on M' and solves indeed Einstein’s equations 
(2.1) on M'. 

We refer to the process of reducing the initial value problem for Einstein’s 
equations to an initial value problem of a hyperbolic system as a hyperbolic 
reduction. The argument given here was developed for the first time in [16] 
with the ‘harmonicity assumption‘ = 0. We prefer to keep the complete 
freedom to specify the gauge source functions, because some important and 
complicated problems are related to this. Our derivation of the system (2.33), 
(2.36), (2.38) illustrates the intricate relations between equation (2.23), the 
constraints, and the conservation of the gauge condition in our case. Though 
other such arguments may differ in details, the overall structure of all hyper- 
bolic reduction procedures are similar. 

2.5 Assumptions and Consequences 

If one wants to investigate single solutions or general classes of solutions 
to the Einstein equations by an abstract analysis of the hyperbolic reduced 
equations, there are certain properties which need to be ‘put in by hand’ and 
others which are determined by the field equations. 

The properties in the first class depend largely on the type of physical 
systems which are to be modelled and on the structure of the hypersurface 
on which data are to be prescribed. The latter could be spacelike everywhere 
as in the Cauchy problem considered above. One may wish in this case to 
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consider cosmological models with compact time slices, one of which is the 
initial hypersurface, one may wish to model the field of an isolated system, 
in which case the initial hypersurface should include a domain extending to 
infinity with the field satisfying certain fall-off conditions, or one may wish 
to consider initial hypersurfaces with ‘inner boundaries’ whose nature may 
depend on various geometrical and physical considerations. Other cases of 
interest are the ‘initial-boundary value problem’(cf. Sect. 4.3), where data 
are given on a spacelike and a timelike hypersurface which intersect in some 
spacelike 2-surface, the ‘characteristic initial value problem’ where data are 
given on two null hypersurfaces which intersect in some spacelike two-surface, 
or various other combinations of hypersurfaces. Finally, we have to make 
a choice of matter model which may introduce specific problems which are 
independent of the features of Einstein’s equations considered above. All these 
considerations will affect the problem of finding appropriate solutions to the 
corresponding constraint equations and the nature of this problem depends 
to a large extent on the nature of the bounding hypersurface. We note that 
the fall-off behaviour of the data on spacelike hypersurfaces can only partly 
be specified freely, other parts being determined by the constraint equations. 

There are conditions which we just assume because we cannot do better. 
For instance, one may wish to analyse solutions which violate some basic 
causality conditions (cf. [52]) or which admit various identifications in the 
future and past of the initial hypersurface. Such violations or identifications 
introduce compatibility conditions on the data which cannot be controlled 
locally. At present there are no techniques available to analyse such situations 
in any generality. In the context of the Cauchy problem we shall only consider 
solutions (M,g) of the Einstein equations which are globally hyperbolic and 
such that the initial hypersurface S' is a Cauchy hypersurface of the solution, 
i.e. every inextendible non-spacelike curve in M intersects S exactly once 

[ 52 ]. 

The class of properties which need to be inferred from the structure of 
the data and the field equations includes in general everything which has to 
do with the long time evolution of the field: the development of singularities 
and horizons or of asymptotic regimes where the field becomes in any sense 
weak and possibly approximates the Minkowski field. This does not, of course, 
preclude the possibility of making assumptions on the singular or asymptotic 
behaviour of the field and analysing the consistency of these assumptions 
with the field equations. But in the end we would like to characterize the 
long time behaviour of the field in terms of the initial data. 

3 PDE Techniques 

3.1 Symmetric Hyperbolic Systems 

In order to obtain results on the existence, uniqueness and stability of solu- 
tions of the Einstein equations it is necessary to make contact with the theory 
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of partial differential equations. A good recent textbook on this theory is [32]. 
One part of the theory which can usefully be applied to the Einstein equations 
is the theory of symmetric hyperbolic systems. This will be discussed in some 
detail in the following. The aim is not to give complete proofs of the results of 
interest in general relativity, but to present some arguments which illustrate 
the essential techniques of the subject. It is important to note that the use of 
symmetric hyperbolic systems is not the only way of proving local existence 
and uniqueness theorems for the Einstein equations. The original proof [16] 
used second order hyperbolic equations. Other approaches use other types 
of equations such as mixed hyperbolic-elliptic systems (see, e.g. [25], Chap. 
10). The basic tools used to prove existence are the same in all cases. First 
a family of approximating problems is set up and solved. Of course, in order 
that this be helpful, the approximating problems should be easier to solve 
than the original one. Then energy estimates, whose definition is discussed 
below, are used to show that the solutions of the approximating problems 
converge to a solution of the original problem in a certain limit. 

Now some aspects of the theory of symmetric hyperbolic systems will be 
discussed. We consider a system of equations for k real variables which are 
collected into a vector-valued function u. The solution will be defined on an 
appropriate subset of M x M”. (The case needed for the Einstein equations is 
n = 3.) A point of K x R" will be denoted by (t, x). The equations are of the 
form: 

A°{t, X, u)dtu + A’‘{t,x, u)diu + B{t, x,u) = 0 

This system of equations is called symmetric hyperbolic if the matrices A° 
and A* are symmetric and A° is positive definite. This system is quasi-linear, 
which means that it is linear in its dependence on the first derivatives. The 
notion of symmetric hyperbolicity can be defined more generally, but here 
we restrict to the quasi-linear case without further comment. It is relatively 
easy to prove a uniqueness theorem for solutions of a symmetric hyperbolic 
system and so we will do this before coming to the more complicated existence 
proofs. A hypersurface S is called spacelike with respect to a solution u of 
the equation if for any 1-form conormal to S (i.e. vanishing on vectors 
tangent to S) the expression A‘^(t,x,u)na is positive definite. This definition 
has a priori nothing to do with the usual sense of the word spacelike in general 
relativity. However, as will be seen below, the two concepts are closely related 
in some cases. Define a lens-shaped region to be an open subset G of R x R" 
with compact closure whose boundary is the union of a subset Sq of the 
hypersurface t = 0 with smooth boundary dSo and a spacelike hypersurface 
Si with a boundary which coincides with OSq. It will be shown that if G 
is a lens-shaped region with respect to solutions ui and U 2 of a symmetric 
hyperbolic system and if the two solutions agree on Sq then they agree on all 
of G. This statement is subject to differentiability requirements. 

Consider a symmetric hyperbolic system where A“ and B are G^ functions 
of their arguments. Let ui and U 2 be two solutions of class G^. Let G be a 
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lens-shaped region with respect to u\ and U 2 whose boundary is the union 
of hypersurfaces So and S\ as before. There exist continuous functions M“ 
and N such that: 

- A°‘{t,X,U2) = M°‘{t,X,Ui,U2){ui - U 2 ) 

B{t,X,Ui) - B{t,X,U2) = N{t,X,Ui,U2){ui - U 2 ) 



This sharp form of the mean value theorem is proved for instance in [50]. It 
follows that: 

A^{ui)da{ui - U2) + [M°‘{ui,U2){daU2) + N{ui, U2)]{ui - U2) = 0 

Here the dependence of functions on t and x has not been written out explic- 
itly. This equation for ui — U 2 can be abbreviated to 



A°‘{ui)da{ui - U2) = Q{ui - U2) 
where Q is a continuous function of t and x. It follows that 

da{{e~'"*A°‘{ui){ui - U2),ui - U 2 )) 

= e~’^\[-kA° {ui) + {daA°‘){ui) + 2Q](mi - U 2 ), ui - U 2 ) 

for any constant k. Now apply Stokes’ theorem to this on the region G. This 
gives an equation of the form Ji = Iq -I- Iq where Iq, I\ and Iq are integrals 
over So, and G respectively. Because S\ is spacelike, I\ is non-negative. 
If k is chosen large enough the matrix P = kA^{ui) — {daA'^){ui) — 2 Q is 
uniformly positive definite on G. This means that there exists a constant 
C > 0 such that {v, P{t, x,ui)v) > G{v,v) for all G and (t,x) in G. 
Thus if Ml — U 2 is not identically zero on G, then the volume integral Iq 
can be made negative by an appropriate choice of k. If ui and U 2 have the 
same initial data then Jq = 0. In this case Iq + Ig is negative and we get a 
contradiction. Thus in fact u\ — U 2 = 0. 

This argument shows that locally in a neighbourhood of the initial hy- 
persurface S the solution of a symmetric hyperbolic system at a given point 
is determined by initial data on a compact subset of S. For any point near 
enough to S is contained in a lens-shaped region. In this context we would like 
to introduce the concept of domain of dependence for a symmetric hyperbolic 
system. There are problems with conflicting terminology here, which we will 
attempt to explain now. If a given subset if of S' is chosen, then the domain 
of dependence D{E) of E is the set of all points of R x K" such that the value 
of a solution of the symmetric hyperbolic system at that point is determined 
by the restriction of the initial data to E. Comparing with the definition of 
the domain of dependence for the Einstein equations given in Sect. 2.4 we 
find a situation similar to that seen already for the term ‘spacelike’. The ap- 
plications of the term to the Einstein equations and to symmetric hyperbolic 
systems might seem at first to be completely unrelated, but in fact there is a 
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close relation. This fact was already alluded to in Sect. 2.4. Even for hyper- 
bolic equations there is another ambiguity in terminology. The definition for 
symmetric hyperbolic systems conflicts with terminology used in some places 
in the literature [27]. Often the set which is called domain of dependence 
here is referred to as the domain of determinacy of E while E is said to be a 
domain of dependence for a point p ii p lies in what we here call the domain 
of dependence. In the following we will never use the terminology of [27]. 
However, we felt it better to warn the reader of the problems than to pass 
over them in silence. 

The relationship of the general definition of the domain of dependence 
for symmetric hyperbolic systems adopted in this article with the domain of 
dependence in general relativity will now be discussed in some more detail. 
As will be explained later, the harmonically reduced Einstein equations can 
be transformed to a symmetric hyperbolic system by introducing the first 
derivatives of the metric as new variables. The comparison we wish to make is 
between the domain of dependence D{S) defined by this particular symmetric 
hyperbolic system and the domain of dependence D{S) as defined in Sect. 2.4. 
The first important point is that a hypersurface is spacelike with respect to 
the symmetric hyperbolic system if and only if it is spacelike in the usual 
sense of general relativity, i.e. if the metric induced on it by the space-time 
metric is positive definite. Given this fact, we see that the notion of lens- 
shaped region used in this section coincides in the case of this particular 
symmetric hyperbolic system derived from the Einstein equations with the 
concept introduced in Sect. 2.4. 

The above uniqueness statement will now be compared for illustrative 
purposes with the well-known uniquess properties of solutions of the wave 
equation in Minkowski space. Here we see a simplified form of the situation 
which has just been presented in the case of the Einstein equations. The wave 
equation, being second order, is not symmetric hyperbolic. However, it can 
be reduced to a symmetric hyperbolic system by introducing first derivatives 
of the unknown as new variables in a suitable way. It then turns out that once 
again the concept of a spacelike hypersurface, as introduced for symmetric 
hyperbolic systems, coincides with the usual (metric) notion for hypersurfaces 
in Minkowski space. It is well known that if initial data for the wave equation 
is given on the hypersurface f = 0 in Minkowski space, and if p is a point in 
the region t > 0, then the solution at p is determined uniquely by the data 
within the intersection of the past light cone of p with the hypersurface t = 0. 
Once the wave equation has been reduced to symmetric hyperbolic form this 
statement can be deduced from the above uniqueness theorem for symmetric 
hyperbolic systems. It suffices to note the simple geometric fact that any 
point in the region t > 0 strictly inside the past light cone of p is contained 
in a lens-shaped region with the same property. Thus the solution is uniquely 
determined inside the past light cone of p and hence, by continuity, at p. 
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The existence of the domain of dependence has the following consequence. 
If two sets Ui and of initial data on the same initial hypersurface S coincide 
on an open subset U of S, then the corresponding domains of dependence 
Di{U) and D 2 {U), as well as the corresponding solutions ui and U 2 on them, 
coincide. In other words, on D\{U) the solution ui is independent of the 
extension of outside U. It follows in particular that there is no need to 
impose boundary conditions or fall-off conditions on ui ’on the edge of S” or 
some periodicity condition to determine the solution locally near a point of 
S. This is referred to as the localization property of symmetric hyperbolic 
systems since it means that the theory is not dependent on knowing how 
the initial data behave globally in space. In the following data on R” will 
be considered which are periodic in the spatial coordinates. This is equiv- 
alent to replacing K" as domain of definition of the unknown by the torus 
T” obtained by identifying the Cartesian coordinates in R" modulo 2tt. In 
order to prove statements about solutions of a symmetric hyperbolic system, 
something must be assumed about the regularity of the coefficients. This will 
in particular imply that is continuous. The continuity of and the com- 
pactness of the torus then show together that A^{t, x, u) is uniformly positive 
definite on any finite closed time interval for any given continuous function 
u defined on this interval. 

The general strategy of the existence proof we will present here will now 
be discussed in more detail. This proof is essentially that given in [90]. The 
first detailed proof of an existence theorem for general quasi-linear symmetric 
hyperbolic systems was given by Kato [60] using the theory of semigroups. 
The proof here uses less sophisticated functional analysis, but the basic pat- 
tern of approximations controlled by energy estimates is the same in both 
cases. For simplicity we will restrict to the special case where is the iden- 
tity matrix. Once the proof has been carried out in that case the differences 
which arise in the general case will be pointed out. Assuming now that A^ is 
the identity, the equation can be written in the form: 

dtu = — X, u)diU — B(t, x, u) 

This looks superficially like an ordinary differential equation in an infinite 
dimensional space of functions of x. Unfortunately, this point of view is 
not directly helpful in proving local existence. The essential point is that 
if A'^{t,x,u)di is thought of as an operator on a space of functions of finite 
differentiability, this operator is unbounded. The strategy is now to replace 
the unbounded operator by a bounded one. In fact we even go further and re- 
place the infinite dimensional space by a finite dimensional one. This is done 
by the introduction of a mollifier (smoothing operator) at appropriate places 
in the equation. The mollifier contains a parameter e. The resulting family 
of equations depending on e defines the family of approximating problems 
referred to above in this particular case. The approximating problems can be 
solved using the standard theory of ordinary differential equations. It then 
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remains to show that the solutions to the approximating problems converge 
to a solution of the original problem as e tends to zero. 

A convenient mollifier on the torus can be constructed using the Fourier 
transform. We now recall some facts concerning the Fourier transform on the 
torus. If M is a continuous complex-valued function on T" then its Fourier 
coefficients are defined by: 

m(^) = f dx 

JTn 

Here ^ is an element of i.e. a sequence of n integers. In fact this formula 
makes sense for any square integrable function on the torus. It defines a lin- 
ear mapping from the space L^(T") of complex square integrable functions 
on the torus to the space of all complex- valued functions on Z^. Its image 
is the space of square summable functions L‘^{Z'^) and the norm of u is 
equal to that of ft up to a constant factor. In particular the Fourier transform 
defines an isomorphism of onto Thus to define an operator 

on L^(T") it suffices to define the operator on which corresponds to 

it under the Fourier transform. These statements follow from the elementary 
theory of Hilbert spaces once it is known that trigonometric polynomials (i.e. 
finite linear combinations of functions of the form can be used to 

approximate all continuous functions on the torus in the sense of uniform 
convergence. This is worked out in detail for the case n = 1 in [80], Chap. 
4. The case of general n is not much different, once the approximation prop- 
erty of trigonometric polynomials is known in that context. This follows, for 
instance, from the Stone- Weierstrass theorem ([81], p. 122). 

Let (() be a smooth real- valued function with compact support on H." which 
is identically one in a neighbourhood of the origin and satisfies 0 < < 1 

for all ^ and I^or a positive real number e let <pe{0 = e~"<(’(C/e)- 

If we identify Z" with the set of points in R" with integer coordinates it is 
possible to define functions cf>e on Z" by restriction. The mollifier is defined 
by _ 

JeU = 4>eU 

for any square integrable complex-valued function u on the torus. It is a 
bounded self-adjoint linear operator on L^(T"). In fact jjJejj = 1 for all e. 
The symmetry condition imposed on ensures that if u is real, J^u is also 
real. Note that, for given e, J^u is only non-zero at a finite number of points, 
the number of which depends on e. It follows that J^u is a trigonometric 
polynomial and that the space of trigonometric polynomials which occurs for 
fixed e and all possible functions u is finite dimensional. As a consequence the 
image of Lg(T"), the subspace of consisting of real-valued functions, 

under the operator is a finite dimensional space Vf:. As e tends to zero the 
dimension of tends to infinity and this is why these spaces can in a certain 
sense approximate the infinite-dimensional space Lg(T"). 

The original problem will now be approximated by problems involving or- 
dinary differential equations on the finite-dimensional spaces VI. If the initial 
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datum for the original problem is the initial datum for the approximating 
problem will be u® = . The equation to be solved in the approximating 

problem is: 

dtUe = —Ji[A'’{t,x,u^)J^diU^ + B{t,x,u^)\ = F{u^) (3.1) 

Here the unknown itg takes values in the vector space V^. In order to ap- 
ply the standard existence and uniqueness theory for ordinary differential 
equations to this it suffices to check that the right hand side is a smooth 
function of provided and B are smooth functions of their arguments. 
Let us first check that F is continuous. If is a sequence of elements of 
which converges to u then J^diUn converges to J^diU. The sequence of func- 
tions X, Un)JediUn + B{t, X, converges uniformly to x, u)J^diU+ 

B(t^x,u). To see that F is continous it remains to show that if 1 u uni- 
formly JeVn — >■ JeV- If > u Uniformly then the convergence also holds 
in the norm. This implies that > u in Hence 4>eVn 

pointwise. From this it can be concluded that J^Vn — >■ JeV. This completes 
the proof of the continuity of F. Differentiability can be shown in a simi- 
lar way. The main step is to show that if u and v are elements of then 
lims_>o X, u + sv) —A'^(t, x, u)] exists (in the sense of uniform conver- 

gence) . By smoothness of it does exist and is equal to D‘iA{t, x, u)v, where 
ZI 3 denotes the derivative with respect to the third argument. In this way the 
existence of the first derivative of F can be demonstrated and the explicit 
expression for the derivative shows that it is continuous. Higher derivatives 
can be handled similarly. Hence F is a smooth function from foe to foe. The 
standard theory of ordinary differential equations now gives the existence of 
a unique solution of the approximating problem for each positive value of 
e. Existence is only guaranteed for a short time Te and at this point in the 
argument it is not excluded that could tend to zero as e — ^ 0. It will now 
be seen that this can be ruled out by the use of energy estimates. 

The basic energy estimate involves computing the time derivative of the 
energy functional defined by 

£1 = y \ue\'^dx (3.2) 

Since the functions are smooth and defined on a compact manifold differ- 
entiation under the integral is justified. 

dE/dt = 2 J (u^,dtu^) 

= -2^ {u„ J,[A’-J,d,u, + B]) 

Using the fact that Jg commutes with the operators di and is self-adjoint, 
gives 

J (Ue,Je[A^JediUe\) = J {JeUe, A^ diJ^Ue) 
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Using Stokes theorem and the symmetry of A’’ gives 

J {JeUe, A^diJeUe) = ~ J {OiA^ J^Ue, JeUe) ~ J {A‘ J^d^Ue, JeUe) 

It follows that 

J {Ue,Je[A"JediUe]) = ~\ j {diA" JeUe, JeUe) 

Substituting this in equation (3.2) gives: 

dtS = J {{d,A^)J,u,-2B,J,u,) 

Now 

mAiJeU,-2B\\L2 < \\d,A^\\L^\\u,\\L2 +2 \\B\\l2 
Hence it follows by the Cauchy-Schwarz inequality that 

dt£<mA^\\L’-£ + 2 \\BU.£^/^ 

This is the fundamental energy estimate. Note that this computation is closely 
related to that used to prove uniqueness for solutions of symmetric hyperbolic 
systems above. 

The existence proof also requires higher order energy estimates. To obtain 
these, first differentiate the equation one or more times with respect to the 
spatial coordinates. Higher order energy functionals are defined by 




The square root of the energy functional £g is a norm which defines the 
Sobolev space iJ®. It is because of the energy estimates that Sobolev spaces 
play such an important role in the theory of hyperbolic equations. Differen- 
tiating the equation for (3.1 gives 

dtiD'^Ue) = -MA\t,x,u,)JAD‘^u,] - J,[D°‘{A\t,x,u,)J,diu) 

— A''{t, X, Ue)JediD°'Ue] ~ JeD°"{B(t, X, U^)) (3.3) 

Differentiating the expression for £g with respect to time causes the quantity 
dt{D°‘u^) to appear. This can be substituted for using the equation (3.3). 
Stokes theorem can be used to eliminate the highest order derivatives, as in 
the basic energy estimate. To get an inequality for £g similar to that derived 
above for £ it remains to obtain estimates for the quantities 

D°‘{A^{t,x,Ug)JgdiU^) — A^{t,x,u^)J^diD°‘u^ 



and 



D°‘B{t, X, Ue) 
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This can be done by means of the Moser inequalities, which will be stated 
without proof. (For the proofs see [90].) 

The first Moser estimate says that there exists a constant C > 0 such 
that for all bounded functions /, g on belonging to the Sobolev space 
the following inequality holds: 

\\D-{fg)\\L^ < cmL^D^gU^ + Wf\\LA\g\\L^) 

Here s = jaj and for a given norm ||Tl®g|| is shorthand for the maximum of 
||H“ 5 || over all multiindices a with jaj = s. The second estimate says that 
there exists a constant C > 0 such that for all bounded functions /, g on 
such that the first derivatives of g are bounded, / is in iJ'’ and g is in 
the following inequality holds: 

\\D^{fg) - fD^gU. < C{\\D^ JWMl^ + \\D g^.) 

The third estimate concerns composition with nonlinear functions. Let F be 
a smooth function defined on an open subset of and taking values in MA . 
There exists a constant C > 0 such that for all functions / on taking 
values in a fixed compact subset K of U and belonging to the Sobolev space 
iL® and any multiindex a with s = jo;| > 1 the following inequality holds: 

p“(f(/))iil 2 < cp/iiripviiL= 

The result of using the Moser inequalities in the expression for d£s/dt is 
an estimate of the form: 

dt£s < C£s 

where the constant C depends on a compact set K in which takes values, as 
above, and the L°° norm of the first derivatives of u^- This can be estimated 
by a function of the norm of Uc- By the Sobolev embedding theorem 
(see e.g. [63]), there is a constant C such that IjiteUci < for any 

s > n/2 + 1. Thus we obtain a differential inequality of the form 



dt£s < f{£s) 



for a function /. It follows that the quantity £s{t) satisfies the inequality 

£s(t) < z{t) 

where z(t) is the solution of the differential equation dz/dt = f(z) with initial 
value £’(0) at t = 0. (A discussion of comparison arguments of this type can 
be found in [51], Chap. 3.) Since the function z remains finite on some time 
interval [0,T], the same must be true for £g for any s > n/2 + 1 and the 
solutions Ue exist on the common interval [0,T] for all e. Morover j]ue(f)j]i/s 
is bounded independently of e for all t G [0,T]. Putting this in the equation 
shows that \\dtUe{t)\\H^-^ is also bounded. 
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The proof of existence of a solution with the given initial datum now 
follows by some functional analysis. The main point is that the boundedness 
of a sequence of functions in some norm often implies the existence of a 
sequence which is convergent in another topology. The reader who does not 
possess and does not wish to acquire a knowledge of functional analysis may 
wish to skip the rest of this paragraph. The space defined by the L°° norm 
of ||M(t)||i/s is a Banach space and is also the dual of a separable Banach 
space. The family is bounded in this space and hence, by the Banach- 
Alaoglu theorem (see [81], pp. 68-70) there is a sequence et tending to zero 
such that Ucf. (t) converges uniformly in the weak topology of to some 

function u which is continuous in t with values in H'^(T") with repect to the 
weak topology. The same argument can be applied to the family dtu^. As a 
result, after possibly passing to a subsequence, dtu^^ (t) converges uniformly 
in the weak topology of On the other hand, since T" is compact, 

the inclusion of i7®(T") in is compact. (This is Rellich’s Lemma. 

See e.g. [47], p. 88) Thus, by the vector- valued Ascoli theorem [29] it can 
be assumed, once again passing to a subsequence if necessary, that 
converges uniformly in the norm topology of i7®“^(T"). If s > n/2 -|- 2 then 
the Sobolev embedding theorem can be used to deduce that it converges 
uniformly in the norm of C^(T"). Once this is known it follows directly that 
the expression on the right hand side of the approximate equation converges 
uniformly to that on the right hand side of the exact equation. On the other 
hand dtUk converges to dtu in the sense of distributions and so u satifies the 
original equation in the sense of distributions. The equation then implies that 
the solution is and is a classical solution. It also has the desired initial 
datum. 

Remarks 

1 . The existence and uniqueness theorem whose proof has just been sketched 
says that given an initial datum uq in H“{T^) with s > n/2 + 2 there is 
a solution for this datum which is a bounded measurable function u{t) with 
values in H“{T^) and is such that dtu is a bounded measurable function with 
values in iJ®“^(T"). With some further work it can be shown that n/2 + 2 
can be replaced by n/2 -|- 1 and that the bounded measurable functions are 
in fact continuous with values in the given space [90]. 

2. The time of existence of a solution which is continuous with values in 
iJ®(T”) depends only on a bound for the norm of the initial data in i7®(T"). 

3. The analogues of the higher energy estimates whose derivation was sketched 
here for the solutions of the approximating problems hold for the solution it- 
self [90]. This implies that as long as the norm of u{t) remains bounded 
the H“ norm also remains bounded. This leads to the following continuation 
criterion. If u{t) is a solution on an interval [0,T) and if |]M(t)||ci is bounded 
independently of t by a constant C > 0 then the solution exists on a longer 
time interval. For there is some Ti such that a solution corresponding to any 
data uo with |]Mo||ff® < C exists on [0,Ti). Considering setting data at times 
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just preceding T, and using uniqueness to show that the solutions obtained 
fit together to give a single solution shows the existence of a solution of the 
original problem on the interval [0,T + Ti). 

4. The previous remark implies the existence of C°° solutions corresponding 

to C°° data. For suppose that a datum uq of class C°° is given. Then uq 
belongs to for each s. Thus there is a corresponding solution on an 

interval [0,Tg) which is continuous with values in iJ®. Assume that Tg has 
been chosen as large as possible. It is not possible that Tg — >■ 0 as s — >■ oo. For 
the boundedness of the solution in iJ®, s > n/2 + 1 implies its boundedness 
in and this in turn implies that the solution can be extended to a longer 
time interval. Thus in fact Tg does not depend on s. Using the equation then 
shows that the solution is C°° . 

5. Analogous statements to all of the above can be obtained for the case 
where A° is not the identity with similar techniques. The essential point is to 
modify the expressions for the energy functionals using A°. For instance the 
basic energy for the approximating problems is given hy £ = f (A^Ug,Ug). 

6. Our treatment here differs slightly from that of Taylor [90] by the use of 
a reduction to finite-dimensional Banach spaces. Which variant is used is a 
matter of taste. 

The statement made in the second remark is a part of what is referred to 
as Cauchy stability. This says that when the initial data is varied the corre- 
sponding solution changes in a way which depends continuously on the data. 
This assertion applies to solutions defined on a fixed common time interval 
[0, rj. The above remark shows that a common time interval of this kind can 
be found for data which are close to a given datum in iJ®(T"). A related 
statement is that if the coefficients of the equation and the data depend 
smoothly on a parameter, then for a compact parameter interval the corre- 
sponding solutions for different parameter values exist on a common time 
interval [0, T] and their restrictions to this time interval depend smoothly on 
the parameter. 

3.2 Symmetric Hyperbolic Systems on Manifolds 

In the last section it was shown how existence and uniqueness statements 
can be obtained for solutions of symmetric hyperbolic systems with data 
on a torus. It was also indicated that these imply results for more general 
settings by means of the domain of dependence. In this section some more 
details concerning these points will be provided. First it is necessary to define 
what is meant by a symmetric hyperbolic system on a manifold M . (In the 
context of the last section M = x I.) In general this will be an equation 
for sections of a fibre bundle E over M which, for simplicity, we will take 
to be a vector bundle. The easiest definition is that a symmetric hyperbolic 
system is an equation of the form P{u) = 0, where P is a nonlinear differential 
operator on M which in any local coordinate system satisfies the definition 
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given previously. For the general invariant definition of a differential operator 
on sections of a bundle the reader is referred to [71]. The object defined 
in local coordinates by a{^) = A°‘^a is called the principal symbol of the 
differential operator (cf. Sect. 2.1). Here is supposed to be evaluated on a 
given section u. The principal symbol is defined invariantly by the differential 
operator. Let L{V) be the bundle of linear mappings from V to itself. Then 
the invariantly defined principal symbol is a section of the pullback of the 
bundle L{V) to the cotangent bundle T*M. Given a local trivialization of 
V it can be identified locally with a, k x k matrix-valued function on the 
cotangent bundle. Here k is the dimension of the fibre of V, i.e. the number 
of unknowns in the local coordinate representation. 

In order to define the condition of symmetry in the definition of symmetric 
hyperbolic systems in an abstract context, it is necessary to introduce a 
Riemannian metric on the bundle V. This metric is not visible in the local 
coordinate representation, since in that context this role is played by the fiat 
metric whose components in the given coordinates are given by the Kronecker 
delta. The symmetry condition is expressed in terms of the principal symbol 
and the chosen inner product as: 

(a(()v,w) = (v,a(^)w) 

for all sections v,wofV and all covectors The positivity of is replaced by 
the condition that for some covector ^ the quadratic form associated to a{^) 
via the metric on V (by lowering the index) is positive definite. A hypersurface 
such that all its non-vanishing conormal vectors have this property is called 
spacelike. 

Suppose now we have a symmetric hyperbolic system on a manifold M , 
a submanifold S oi M and initial data such that S is spacelike. The aim is 
to show the existence of a solution on an open neighbourhood U oi S in M 
with the prescribed data and that this is the unique solution on U with this 
property. Choose a covering of S by open sets Ua with the property that for 
each a the closure of this set is contained in a chart domain over which the 
bundle V can be trivialized globally. The problem of solving the equations 
on some open region in M with data the restriction of the original initial 
data to Ua can be solved by means of the results already discussed. For the 
local coordinates can be used to identify Ua with an open subset U'a of a 
torus T" and the initial data can be extended smoothly to the whole torus. 
The equation itself can also be transferred and extended. Corresponding to 
the extended data there exists a solution of the equations on T" x I for 
some interval I. The coordinates can be used to transfer (a restriction of) 
the solution to an open subset Wa of M which is an open neighbourhood 
of Ua- The domain of dependence can then be used to show that there is 
an open neighbourhood where the solutions on the different Wa agree on 
the intersections. More specifically, on an intersection of this kind the two 
in principle different solutions can be expressed in terms of the same local 
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coordinates and trivialization. By construction the coordinate representation 
of the data is the same in both cases. Thus the local uniqueness theorem 
for symmetric hyperbolic systems implies that both solutions are equal in 
a neighbourhood of the initial hypersurface. Thus the solutions on different 
local patches fit together to define the desired solution. Moreover the domain 
of dependence argument shows that it is unique on its domain of definition. 

3.3 Other Notions of Hyperbolicity 

There are two rather different aspects of the concept of hyperbolicity. The 
one is an intuitive idea of what good properties a system of equations should 
have in order to qualify for the name hyperbolic. The archetypal hyperbolic 
equation is the wave equation and so the desired properties are generalizations 
of properties of solutions of the wave equation. The first property is that the 
system should have a well-posed initial value problem. This means that there 
should exist solutions corresponding to appropriate initial data, that these 
should be uniquely determined by the data, and that they should depend 
continuously on the data in a suitable sense. The second property is the 
existence of a finite domain of dependence. This means that the value of 
solutions at a given point close to the initial hypersurface should depend 
only on data on a compact subset of the initial hypersurface. 

The first aspect of hyperbolicity which has just been presented is a list of 
wishes. The second aspect is concerned with the question, how these wishes 
can be fulfilled. The idea is to develop criteria for hyperbolicity. For equations 
satisfying one of these criteria, the desired properties can be proved once and 
for all. Then all the user who wants to check the hyperbolicity of a given 
system has to do is to check the criteria, which are generally more or less 
algebraic in nature. An example is symmetric hyperbolicity, which we have 
already discussed in detail. 

Symmetric hyperbolicity can be applied to very many problems in gen- 
eral relativity, but there are also other notions of hyperbolicity which have 
been applied and cases where no proof using symmetric hyperbolic systems is 
available up to now. We will give an overview of the situation, without going 
into details. The concepts which will be discussed are strict hyperbolicity, 
strong hyperbolicity and Leray hyperbolicity. This multiplicity of definitions 
is a consequence of the fact that there is no one ideal criterion which covers 
all cases. 

There is unfortunately to our knowledge no fully general treatment of 
these matters in the mathematical literature. A general theory of this kind 
cannot be given in the context of this article, and we will refrain from stating 
any general theorems in this section. We will simply indicate some of the 
relations between the different types of hyperbolicity and give some references 
where more details can be found. 

The discussion will use some ideas from the theory of pseudodifferential 
operators. For introductions to this theory see [88], [89], [3]. One important 
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tool is an operator A which defines an isomorphism from functions in the 
Sobolev space to functions in the Sobolev space Powers of this 

operator can be used to associate functions with a given degree of differentia- 
bility (in the sense of Sobolev spaces) with functions with a different degree of 
differentiability. The operator A is non-local. On the torus it could be chosen 
to be the operator (1 — defined in an appropriate way. 

The concept of strict hyperbolicity is easily defined. Suppose that we have 
a system of iV equations of order m. Suppose there is a covector r such that 
for any covector ^ not proportional to r the equation det cr(x, r -I- A^) has 
Nm distinct real solutions A. In this case we say that all characteristics are 
real and distinct and that the system is strictly hyperbolic. The weakness of 
this criterion is due to the fact that it is so often not satisfied for systems of 
physical interest. For example, the system for two functions u and v given by 
the wave equation for each of them is not strictly hyperbolic. The wave equa- 
tion itself is strictly hyperbolic but simply writing it twice side by side leads 
to characteristics which are no longer distinct, so that strict hyperbolicity is 
lost. 

Given a strictly hyperbolic system of order m with unknown u, let Ui = 
A~'^dlu for i = 1, 2, . . . , to — 1. Then the following system of equations is 
obtained: 



dtUi = Aui+i] 0 < i < TO — 1 
dtUm-l = A-'^+^dTu) 

where it is understood that d]P‘u should be expressed in terms of uq, . . . , Wm-i 
by means of the original system. This system is first order in a sense which is 
hopefully obvious intuitively and which can be made precise using the theory 
of pseudodifferential operators. This is not a system of differential equations, 
due to the nonlocality of A, but rather a system of pseudodifferential equa- 
tions. Suppose we write it as dtv + A^(v)diV + B{y) = 0, where v is an 
abbreviation for Uq, . . ■ The idea is now to find a pseudodifferential 

operator of order zero depending on the unknown so that A^{y)A^{y) 

is symmetric for all v and i and A^ has suitable positivity properties. If an A° 
of this kind can be found, then multiplying the equation by it gives something 
which looks like a pseudodifferential analogue of a symmetric hyperbolic sys- 
tem. This symmetrization of the reduced equation can in fact be carried out 
in the case of strictly hyperbolic systems and the resulting first order pseu- 
dodifferential equation be treated by a generalization of the methods used 
for symmetric hyperbolic systems [89], [90]. This gives an existence theorem, 
but does not directly give information about the domain of dependence. This 
information can be obtained afterwards by a different method [88] . 

The algebra involved in symmetrizing the reduction of a strictly hyper- 
bolic system has not been detailed. Instead we will present some general 
information about symmetrization of first order systems using pseudodiffer- 
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ential operators. Consider then the equation 

dtu + A^{u)diU = 0 

If there exists a function A^{u) such that A^ is positive definite and A'^A^ 
is symmetric for each i then multiplying the equation by gives the 

equation: 

A°{u)dtu + A°{u)A^{u)diU = 0 

which is symmetric hyperbolic. This can be generalized as follows. Each dif- 
ferential operator has a symbol which is polynomial in In the theory of 
pseudodifferential operators, operators are associated to symbols which no 
longer have a polynomial dependence on Within this theory the general 
statement (which must of course be limited in order to become strictly true) 
is that any algebra which can be done with symbols can be mimicked by op- 
erators. In terms of symbols the problem of symmetrization which has been 
stated above can be formulated as follows. Given symbols which 

are matrix- valued functions, find a positive definite symmetric real matrix- 
valued function oP{x,u) such that a^{x,u)^iO^{x,u) is symmetric. When the 
problem has been formulated in this way a generalization becomes obvious. 
Why not allow a? to depend on In the context of pseudodifferential op- 
erators this can be done. In order for this to be useful aP must be of order 
zero, which means that it must be bounded in ^ for each fixed (x, u) and 
that its derivatives with respect to ^ and x satisfy similar conditions which 
will not be given here. In fact it is enough to achieve the symmetrization of 
the symbol for ^ of unit length, since the ^ dependence of the procedure is 
essentially homogeneous in ^ anyway. It is also important that it be possible 
to choose aP in such a way that it depends smoothly on since the theory of 
pseudodifferential operators requires sufficiently smooth symbols. To prevent 
confusion in the notation, aP will from now on be denoted by r while we write 
a{x,^,u) = a^{x,u)^,. 

A criterion which ensures that a first order system of PDE can be sym- 
metrized in the generalized sense just discussed is that of strong hyperbolicity 
[63]. It is supposed that all characteristics are real, that they are of constant 
multiplicity, and that the symbol is everywhere diagonalizable. Furthermore, 
it is assumed that this diagonalization can be carried out in a way which 
depends smoothly on x, ^ and u. This means that there is a symbol b{x, u) 
which satifies 

b~^{x,^,u)a{x,^,u)b{x,^,u) = d{x,^,u) 

where d is diagonal. Let r{x,^,u) = [b{x,^,u)b'^{x,^,u)]~^ where the su- 
perscript T denotes the transpose. Then r has the desired properties. For 
r = b~^{b~^Y^ is symmetric and positive definite while ra = db~^ is 

also symmetric. The one part of this criterion which is not purely algebraic 
is the smoothness condition. It is, however, often not hard to check once it is 
known how to verify the other conditions. 
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Another related notion of hyperbolicity is Leray hyperbolicity [66] . It al- 
lows operators with principal parts of different orders in the same system. We 
will discuss this in terms of using powers of the operator A to adjust the or- 
ders of the equations. This is somewhat different from the approach of Leray 
but is closely related to it. Suppose we have a system of PDE with unknown 
u. Let us split the vector u into a sequence of vectors ui, . . . ,rti. Then the 
differential equation which we may write schematically as P{u) = 0 can be 
rewritten as Pj{ui) = 0, where i and j run from 1 to L. Let Vi = mul- 

tiply the equations Pj{ui) = 0 with The result is a system of equations 
of the form Qj{vi) = 0, where Qj{vi) = A*^^^ Pj{A~“^'^'> Vi) . The order of Qj in 
its dependence on Vi is the order of Pj in its dependence on Ui plus t{j) — s{i). 
Now we would like to choose t{j) and s{i) in such a way that all operators Qi 
have the same order (say one) in their dependence on the corresponding vari- 
able Vi and lower order (say zero) in their dependence on Vj for i ^ j- Adding 
the same amount to all indices simultaneously is irrelevant. The indices are 
only determined up to a common additive constant. The system is Leray hy- 
perbolic if this can be achieved by suitable choices of the decomposition and 
the indices, if the operators Pi corresponding to the given decomposition, 
considered with respect to their dependence on Ui, are strictly hyperbolic, 
and if the characteristics of these strictly hyperbolic operators satisfy a cer- 
tain condition. (We say more on this condition later.) It is now plausible 
that this decomposition can be combined with the symmetrization of strictly 
hyperbolic systems to obtain a system of first order pseudodifferential equa- 
tions which admit a pseudodifferential symmetrization. We do however stress 
that as far as we know the only place where the existence proof for Leray 
hyperbolic systems is written down in the literature is in the original lecture 
notes of Leray [66] and that the details of the plausibility argument presented 
here have not been worked out. It has the advantage that it gives an intuitive 
interpretation of the meaning of the indices s(i) and t(j). All the components 
of V in the solution will have the same degree of differentiability. If this is 
then the variables Ui will have differentiability ^ 

We now comment on the condition on the characteristics mentioned in 
the discussion of Leray hyperbolicity. For a strictly hyperbolic system there 
is a notion of spacelike covectors analogous to that for symmetric hyperbolic 
systems. The position of these spacelike covectors is closely related to the 
position of the characteristics. The required condition is that there should 
be vectors which are simultaneously spacelike for the L strictly hyperbolic 
systems occurring in the definition. 

These general ideas concerning Leray hyperbolic systems will now be il- 
lustrated by the example of the Einstein-dust system. This example was first 
treated by Choquet-Bruhat [17]. A discussion of this and other examples can 
also be found in the books of Lichnerowicz [67], [68]. We will have more to 
say about this system in Sects. 4.2 and 5.4. The Einstein-dust equations are 
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V„(pi7“) = 0 
U°‘V^U'^ = 0 
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The problem with these equations from the point of view of symmetric hy- 
perbolic systems is that while the equation for the evolution of p contains 
derivatives of the equation for the evolution of U°‘ does not contain p 
This could be got around if the derivatives of C/“ in the evolution equation 
for p were considered as lower order terms. This is only possible if the first 
derivatives of U°^ are considered on the same footing as p. The adjustment 
of orders involved in the Leray theory, combined with an extra device, allows 
this to be achieved. In order that everything be consistent we expect the order 
of differentiability of the metric to be one greater than that of C/“. For the 
Christoffel symbols occur in the evolution equations for C/“ . Combining these 
two things means that the density should be two times less differentiable than 
the metric. This creates problems with the Einstein equations. For these are 
essentially (and the harmonically reduced equations are precisely) non-linear 
wave equations for the metric. The solution of a system of this kind is only 
one degree more differentiable than the right hand side. This does not fit, 
since the density occurs on the right hand side. The extra device consists 
is differentiating the Einstein equations once more in the direction [/“ and 
then substituting the evolution equation for p into the result. This gives the 
equation: 

= -8npUaUpV-yU^ 

Note that the right hand side of this contains no derivatives of p and so is not 
worse than the right hand side of the undifferentiated equation. On the other 
hand, the left hand side is, in harmonic coordinates, a third order hyperbolic 
equation in and the solution of an equation of this type has (by the Leray 
theory) two more degrees of differentiability than the right hand side. 

After this intuitive discussion of the Einstein-dust system, let us show how 
it is related to the choice of indices needed to make the Leray hyperbolicity of 
the system manifest. The first thing which needs to be done is to decide which 
equations should be considered as the evolution equations for which variables. 
This has already been done in the above discussion. The relative orders of 
differentiability we have discussed suggest that we choose s{g) = 1, s{U) = 2 
and s{p) = 3. This equalizes the expected differentiability of the different 
variables. In the new variables the evolution equations for g, U and p are of 
order four, three and four respectively. To equalize the orders we can choose 
t{g) = 1, t{U) = 2 and t{p) = 1. The blocks into which the system must 
be split are not just three but fifteen, one for each component. The indices 
s{i) and t{j) are here chosen the same for each component of one geometrical 
object. (Here we have ignored the difficulty that because of the normalization 
condition = — 1 the variables C/“ are not all independent.) 
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4 Reductions 

In Sect. 2 we saw an example of a hyperbolic reduction for Einstein’s field 
equations which is, at least in the vacuum case, sufficient to obtain local 
existence results and to demonstrate that the local evolution is dominated 
by the light cone structure and the associated concept of the domain of 
dependence. Thus, besides the existence problem, it settles the conceptual 
question whether the evolution process determined by Einstein’s equations is 
consistent with the basic tenets of the theory. 

Nevertheless, there are various reasons for considering other types of re- 
ductions. Different matter fields may require different treatments, various 
physical or geometrical considerations may require reductions satisfying cer- 
tain side conditions, the desire to control the gauge for an arbitrarily long 
time may motivate the search for new gauge conditions. In particular, in re- 
cent years various systems of hyperbolic equations deduced from Einstein’s 
equations have been put forward with the aim of providing ‘good’ systems for 
numerical calculations (cf. [1], [4], [14], [22], [38], [43], [56]). Since it is difficult 
to judge the relative efficiency of such systems by a few abstract arguments, 
detailed numerical calculations are needed to test them and it still remains 
to be seen which of these systems will serve the intended purpose best. 

A general discussion of the problem of finding reductions which are use- 
ful in numerical or analytical studies should also include systems combining 
hyperbolic with elliptic equations (cf. [25], [77], [24] for analytic treatments 
of such situations) or with systems of still other types. For simplicity we will 
restrict ourselves to purely hyperbolic reductions. But even then a general 
discussion does not yet appear feasible. Ideally, one would like to exhibit a 
kind of ‘hyperbolic skeleton’ of the Einstein equations and a complete charac- 
terization of the freedom to fix the gauge from which all hyperbolic reductions 
should be derivable. Instead, there are at present various different methods 
available which have been invented to serve specific needs. 

Therefore, we will present some of these methods without striving for 
completeness. There are various different boundary value problems of inter- 
est for Einstein’s equations and tomorrow a new question may lead to a new 
solution of the reduction problem. Our aim is rather to illustrate the enor- 
mous richness of possibilities to adapt the equations to various geometrical 
and physical situations, and to comment on some of the new features which 
may be observed. Apart from the illustrative purpose it should not be forgot- 
ten, though, that each of the reductions outlined here implies, if worked out 
in detail, the local existence of solutions satisfying certain side conditions. 

We begin by recalling in general terms the steps to be considered in a 
reduction procedure. It should be noted that these steps are not independent 
of each other, and that the same is true of the following considerations. 

As a first step we need to choose a representation of the field equations. 
Above we used the standard representation in which Einstein’s equations are 
written as a system of second order for the metric coefficients and we 
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also considered the ADM equations with the first and second fundamental 
forms of the time slicing as the basic variables. Later we shall also employ 
representations involving equations of third order in the metric field, in which 
the metric is expressed in terms of an orthonormal frame field. 

Contrary to what has been claimed by too ambitious or suggested by am- 
biguous formulations, there is no way to get hyperbolic evolution equations 
without fixing a gauge. Hyperbolicity implies uniqueness in the PDE sense 
(in contrast to the notion of geometric uniqueness used for the Einstein equa- 
tions, cf. Sect. 5.2). It is necessary to make a choice of precisely four gauge 
source functions. 

Although in the proof of an existence result one has to fix a coordinate 
system, it is of interest to note that there are choices of gauge conditions 
which render the reduced equations hyperbolic for any fixed choice of co- 
ordinates. In Sect. 2 we considered the functions as gauge source 

functions determining the coordinates. Since these functions can be chosen 
arbitrarily and since the principal part of the reduced equations (2.39) will 
not be changed if they depend on the metric coefficients, they can be assumed 
to be of the form F'' = F'^{x^,g\p). 

The following choice is particularly interesting. Let an affine connection 
Fu^ X be given on the manifold M = K. x S' on which we want to construct 
our solution. Since the difference Fy^^x — F^^^x, where the first term denotes 
the connection of the prospective solution g^^, defines a tensor field, the 
requirement that the equation = g'^^ Fv^ x be satisfied, has an invariant 
meaning. This suggests a way to impose a gauge condition which removes the 
freedom to perform diffeomorphisms while leaving all the freedom to perform 
coordinate transformations. In particular, if we assume the connection /h ^ a 
to be the Levi-Civita connection of a metric cjpi, on M, the condition F^ = 
guX equation (2.39) corresponds to the requirement that the identity 

map of M onto itself is a harmonic map under which g is pulled back to g 
(cf. also [37], [38]). 

Again, we do not have a good overview of all the possibilities to impose 
gauge conditions. There exist conditions which work well with quite different 
representations of the field equations. Examples are given by the choice of 
gauge source functions F'^{x^) considered above, which work with suitably 
chosen gauge conditions for the frame field also in the frame formalism ([36]), 
or by the gauge, considered also below, in which the shift j3°‘ and the function 
q = log{ah~'^), with h = det(hab) and cr = const. > 0, are prescribed as 
gauge source functions (cf. [4], [22], [38] for reductions based on cr = | and 
different representations of the Einstein equations). There are other gauge 
conditions which only work for specific representations like some of the ones 
we shall consider in the context of the frame formalism. For some gauge 
conditions, like the ones employing the gauge source functions F'^{x^), the 
universal applicability has been shown (cf. our argument in Sect. 2) or is easy 
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to see. For others, like the ones using the gauge source function q above, it 
has apparently never been shown. 

The gauge problem does not admit a ‘universal solution’ which works in all 
possible situations of interest. In fact, choosing the gauge is related to some of 
the most complicated questions of constructing general solutions to Einstein’s 
equations. Often one would like to find a system of coordinates which covers 
the complete domain of existence of a solution arising from given initial data. 
If there existed such coordinates x^, we could, in principle, characterize them 
in terms of the associated gauge source functions considered in Sect. 2. 

However, the domain of validity, in particular the ‘lifetime’, of a coordinate 
system depends on the data, the equations, the type of gauge condition, 
as well as on the given gauge source functions. In practice, gauge source 
functions which ensure that the coordinates exist globally, if there exist any 
at all, have to be identified in the course of constructing the solution. 

Having implemented the gauge conditions, we have to find a hyperbolic 
system of reduced equations from our representation of the Einstein equations. 
As we shall see, there are often various possibilities and the final choice will 
depend on the desired application. With the reduced equations at hand we 
have to arrange initial data which satisfy the constraints and are consistent 
with the gauge conditions. While the second point usually poses no prob- 
lem, the first point involves solving elliptic equations and requires a seperate 
discussion. 

The reduced equations define together with the initial data a well-posed 
initial value problem and we can show, by using standard techniques of the 
theory of partial differential equations (cf. Sect. 3.1), the existence of solutions 
to the reduced problem, work out differentiability properties of solutions etc. 
or start the numerical evolution. 

As the final step one needs to show that the constraints and gauge con- 
ditions are preserved by the evolution defined by the reduced equations. One 
may wonder whether there exists a universal argument which tells us that this 
will be the case for any system of hyperbolic reduced equations deduced from 
the Einstein equations. The example of the spinor equations discussed below 
shows that this cannot be true without restrictions on the matter fields. How- 
ever, even if we ask this question about reduced problems for the Einstein 
vacuum field equations, the answer seems not to be known. The standard 
method here is to use the reduced equations, as in Sect. 2, together with 
some differential identities, to show that the ‘constraint quantities’ satisfy 
a certain subsidiary system which allows us to argue that these quantities 
vanish. 

If the gauge condition used in our reduced equations has been shown to be 
universally applicable and if there already exists another hyperbolic reduction 
using these gauge conditions and applying to the same geometric and physical 
situation (choice of matter model), it can be argued, invoking the uniqueness 
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property of initial value problems for hyperbolic equations, that our reduced 
equations must preserve the constraints and the gauge conditions. 

In the examples which follow we shall not always produce the complete re- 
duction argument. Often we shall only exhibit a symmetric hyperbolic system 
of reduced equations and remark on some of its properties. 

4.1 Hyperbolic Systems from the ADM Equations 

In the article [11] (cf. also [86]) the authors derived a system from equations 
(2.10), (2.11), (2.22) and (2.23) which seems to be numerically distinctively 
better behaved than any other system derived so far from the ADM equations. 
However, at present there appears to be no clear understanding as to why 
this should be so. The symmetric hyperbolic system to be discussed below 
was found in the course of an attempt to understand whether the system 
considered in [11] is in any sense related to a hyperbolic system. Since other 
hyperbolic systems related to the equations in [11] have been discussed in [2], 
[44] there is now a large family of such systems available. 

In the following we shall consider the fields 

/?“, q = log{ah~n, (4.1) 

with h = det (hab), <J = const. > 0, as the gauge source functions. The 
density h will be used to rescale the 3-metric and the trace-free part of the 
extrinsic curvature to obtain the densities hab = h~ 3 hat, Xab = h~ 3 [xab — 
4 xhab)- For simplicity we shall refer to hab to the conformal metric or, if 
no confusion can arise, simply as to the metric. The following equations are 
derived from the ADM equations by using the standard rules for conformal 
rescalings. The occurrence of some of the terms in the following equations 
find their proper explanation in the general calculus for densities but we shall 
not discuss this here any further. 

The unknowns in our equations will be the fields 

hab, r] = logh, r]a = Dalogh, x=h‘"^Xab, (4.2) 

Ta — Ta ( g 4” Cr) Da ft)g /i, Xab, habc — hab^c, (4-3) 

where % is defined for the metric hab in analogy to (2.32). Note that the power 
of the scaling factor h has been chosen such that we have h = det (hab) = 1, 
whence 0 = h a = hh°'^hab,c = h°-^ habc- We denote by D the covariant 
derivative, by ja^ c the connection coefficients, and by R the Ricci scalar 
which are defined for the conformal metric hab by the standard rules. In 
all expressions involving quantities carrying a tilde any index operations are 
performed with the metric hab- 

Our equations are obtained as follows (cf. also [11]). From (2.22) we get 
by a direct calculation 
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hah hab,c — 2 /lc(a ,5) ^ hab /? ,c “h 2 CT \ab: (4-^) 



5t ?7 - 77,c Z?"" = 2 /3^ ,c + 2 a x- 

Taking derivatives on both sides of the second equation gives 



(4.5) 



dt r]a - r]a,c /3^ = 2 a z)a X + 2 X (cr ??Q + g) + f?c /3° .a + 2 /3^ ,ca- (4.6) 
Equation (2.23) implies together with (2.22) that 

dtx - X,d3" = DcD^ a - a{{\ - a) R + aR + + {h°'’’ Tab - 3p)), 

with an arbitrary real number a. To replace R in the second term on the 
right hand side, we use the Hamiltonian constraint (2.10), to replace R in the 
third term we use the transformation law of the Ricci scalar under conformal 
rescalings and the expression of the Ricci scalar R in terms of the conformal 
quantities, which give 



Thus we get 



R = h 3 log h) - 7c ^ 6 7^+ 

+^“*’ 7a “ d 7c f, - E>a log /r log hi . 



h-3 -{dtX- X,cf3‘") = -aD°-% + {a + a{l^ - a)) D'^ rja (4.7) 
a 2 

+£»“ ^ag + a(7c"d7‘'-^“'’7a"d7c^) 

+ ( ^ ha + ( 1 + 2 cr) 77 “ T>aq + DaQ 

18 6 6 

—h3 {(1 — a) Xab X°^^ + 2 (1 + 2 a) x^ ~ (1 + 2 a) A + — {h“*' Tab + (1 — 2 a) p)}. 

Using the equations above, the definition of 7 a, and the momentum constraint 
(2.11) as well as its expression in terms of the conformal quantities, 

D^'Xca- ^-DaX+ ^Xcah>° log h+ ^ X log h= h'' a, 

we obtain, with arbitrary real number c, by a direct calculation the equation 

- = -cU'^Xca + 2(^ + ^-a)DaX (4.8) 

a 2 6 

+{2 + c) a + 2 Xa^" ^ - 2 a) Pa + Daq} 
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X ~ c ~ 

-x(( 3 +2cr) (arja + Da q) + -rja) - 2 d hab 

+ - hac,b /3" ,d - {Kb lc\ + Kb ld\)~ K’^ ,6 Kc,d 

a I 

“(2 K Vb ,a + K'^ {Kc /3^ ,a + Kb 0^ ,c) ,d ~ (1 + 2 ct) /3^ • 

For the rescaled trace free part of the extrinsic curvature we get from (2.23) 
the equation 

\ah \ah,c ^ 2 %c(a ^ ,h) ^ ^ 

= -a /l“ 3 {Rab “ ^ ^ ^ {^a Db a ~ ^ Dc O'" a hab) 

-a{xKb-2K^ Xac Kd) ■ 

To express the equation in terms of the conformal quantities we use the 
transformation law of the Ricci tensor under conformal rescalings and the 
expression of the conformal Ricci tensor in terms of the connection coefficients 
to get 

Ra6 3 ^ ^ab — 2 ^ ^ab^cd “t“ f^(^a Xb) ^ X 

+7c aKdK^ b + 2K‘" ch‘^^ K{a 7b) ^ Kb (7c dX‘^-K^ K‘" dK‘^ /) 

+ ^ {Da log hDblogh-^ Kb K‘^ Dc log hDd log h). 

Thus we obtain 

- {dt Kb - Kb,c 00 = \ K‘^ Kb,cd - D(^a_%) + ^ Kb D‘^ 7c (4.9) 
a 2 ^ ^ 6 

+Da Dbq - ^ Kb D° D^q + Daq D^q - ^ Kb D'^qDcq 

+ (2cr- ^){r]{aDb)q- ^KbK Dcq) + {K - ^a- ^){VaVb~ ^^ab??c?7'') 
-7c a KdK^ b -2 K" C K{a Xb) f ~ ^ Kb {Xc " e V ~ K‘^ Xc"" f Xe ^ d) 

4”2 Xc{a ^ ,b) T ^ Xab S (X Xab 2h Xac Xbd') • 

In all the equations expressions like DaK etc. are defined by the expressions 
which would hold if K denoted a tensor field. Finally, we get from (4.4) that 

0t ^abc ^abc,d 0 2 O Dq Xab 



(4.10) 



170 



Helmut Friedrich and Alan Rendall 



+2 Xab a{ar]c + DcQ) + 4 a > Xb)d + Kbd ,c 
^ ^ 2 - 

+2 (,a hh)dc + 2 /ld(b (3'^ ^a)c + 2 {habc ,d + hab ,dc) ■ 

If c > 0, the system (4.4), (4.5), (4.6), (4.7), (4.8), (4.9), (4.10) is symmetric 
hyperbolic. This can be seen as follows, (i) We choose real numbers e, / 
satisfying 

e > 0, />0, /a = -2 (^+|-ct), e = | /-(^-ct) (i + ^-cr), (4.11) 

and multiply some of the equations by overall factors c, e, /, a~^, to obtain 
them in the form (writing out only their principal parts here) 



dthab-Kb,cP^ = .-., (4.12) 

dtr] - = ■ ■ ■ , (4.13) 

- {dt Va - Va,c P‘"} = e{2DaX + ■ ■ •}, (4.14) 

a 

-{dtX-X,cPl = + + (4.15) 

a f 

- 7o - 7a.c /3°} = -cD^'xca - faDaX+ (4.16) 

a 



- {dtXab-Xab,cP‘'} = c{l-h‘'^ hab, cd-D(^a%) + ^ hab D° (4.17) 

a Z 6 

habc - habc,dP'^} = c{\baXab + ■■■}■ (4.18) 

4a Z 

(ii) We contract both sides of (4.14) and (4.16) with /i^“, both sides of (4.17) 
with both sides of (4.18) with and add on the right 

hand side of the equation obtained in this way from (4.16) a term of the 
form f Da Xcd, which vanishes identically but whose formal occurence 

makes the symmetry manifest. 

The range in which the coefficients a, c, e, /, cr are consistent with (4.11) 
can be seen by considering the following cases: 

n 1 c 

i) a = 0ocr=- + -. 
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In this case we have e = ( | + |) / and we can choose c, / > 0 arbitrarily. If 
a yf 0 we have / = — yd + f — cr) and e = {a ~ (5 + f ~ which 

gives the following restrictions 



ii) a < 0, 



|a| ^ ^ 1 c 

2(|a| + l) 2^3’ 



in) 0 < a < 1, 



iv) 



1 < a < 



3 -f 2 c 
2c 



1 

2 



1 c 

2 + 3 ^ 

c a 

3 ^ ^ ^ 2 (a- !)■ 



We cannot have e > 0, f > 0 with a > . 

It is of interest to study the characteristics of the system. Equations (4.12) 
and (4.13) contribute factors n(^) = to the characteristic polynomial. 
To find the other characteristics we analyse for which ^ 0 the following 
linear system of equations, defined by the principal symbol map, admits non- 
trivial solutions. We set ^b- 



Va = 2^aX, 

h-^n{0x=-aC% + 2jCVa. 

n{0 la = -C Xba- fa X, 
rr(^) Xab — 2 ^abc C 'C(a lb) T g ^ab ^ Icj 

a{^) hoj^c ~ 2 Xab' 

The condition n{^) = 0 implies Xab = 0 and x = 0 but there remains a 
fifteen-parameter freedom to choose the remaining unknowns. If 

n(a 7 ^ 0, (4.19) 

we derive from the equations above the further equations (writing g{^,C) = 

0xab = n(6 (C(a 7b) - 3 hab r 7c) , 

whence 

{5(?, 0 + ^ ?C % = r 7 , + / a x) e 

2 6 n(^) 

The latter equation implies 

{git 6 + y ^4 r % + ttfax = 0, 



(4.20) 
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and, with such that Ca = 0, 

Finally we get 

{5(? , a + (a - 1) (1 - 2 a) ^4 X - a /i- 5 ^(4 T 7c = 0. (4.21) 

Equations (4.20) and (4.21), are of the form Au = Q with the unknown u the 
transpose of (x,C*7*) the matrix A satisfying 

det A = { 5 (e, 4 + ^ (1 - a) ^c C 4 { 4 ^, 4 + (2 a - 1 ) ^ 4 . 

If 

3(^4) = 0, (4.22) 

it follows from the first of these equations that = 0. If a yf 1, ct yf | 

there remains the two-parameter freedom to choose Xab with Xab = 0. If 
a = 1 or cr = ^ there remains the three-parameter freedom to choose x, Xab 
satisfying Xab = —^^bX- If dUi 4 7 ^ 0 Hi® Xij is known once 7 a has 
been determined. If 

+ = (4.23) 

there is a two-parameter freedom to choose p* 4 S'® above. This is the only 
freedom unless a=|orcr=I-|-|, conditions which exclude each other. 
Further characteristics are given by the equations 



5(?4) + y(l-a) = 


(4.24) 


4C,4 + (2a-i)/i=‘^ec4 = o. 


(4.25) 



The ‘physical’ characteristics correspond of course to (4.22), the two-parameter 
freedom pointed out in the case a yf 1, cr yf I corresponding to the two po- 
larization states of gravitational waves and the additional freedom in the 
other cases corresponding essentially to a gauge freedom. The timelike char- 
acteristics corresponding to (4.19) occur because of the transition from the 
system of second order to a system of first order. These characteristics are 
neither ‘physical’ nor ‘harmful’. Characteristics corresponding to (4.23) are 
spacelike, while the nature of the characteristics corresponding to (4.24) and 
(4.25) depends on the constants a and a. It can happen that one is spacelike 
while the other is timelike or both are spacelike. 

We note here that it is possible to obtain by similar procedures reduced 
equations, for somewhat more complicated unknowns, which have only char- 
acteristics which are timelike or null [44]. 

Though the number c > 0 can be chosen arbitrarily (suitably adjusting 
the others) it is not possible to perform the limit c — >■ 0 while keeping the 



The Cauchy Problem 173 



symmetric hyperbolicity of the system, however, the equations in [11] can 
be considered as limit of equations which are algebraically equivalent to our 
systems. We note also that it is not possible to perform a regular limit cr — 1 0 
which would make lapse and shift the gauge source functions. In the limit 
as c — >■ 0 (which we can perform if we do not insist on hyperbolicity) all 
characteristics, with the exception of the gauge dependent characteristics 
corresponding to (4.25), become non-spacelike. 

We finally remark on our gauge conditions. Under certain assumptions 
the gauge (4.1) coincides with the gauge with harmonic time coordinate and 
prescribed shift. From (2.7) follows the general relation 

dta - a^a x~ o? 

Equation (4.5), written as an equations for h, entails together with (4.1) the 
equation 

dta - /3“ = 2 cr X + a (2 cr /3“ + dtq - q,ad°‘) ■ 

Thus the time coordinate t is harmonic in our gauge, i.e. = 0, if cr = ^ 
and dtq — q,aP°' = — /?“ ,a- In more general situations the expression for 
implied by the equations above contains information about the solution and 
admits no direct conclusion about time-harmonicity in terms of a and /?“. 

It follows from complete reductions based on the gauge source function 
q (cf. e.g [38], [44]) that on solutions of Einstein’s equations it is possible 
to achieve the corresponding gauge close to some initial surface. However, it 
has apparently never been shown that for prescribed cr > 0, /3“ and given 
spacelike hypersurface of some arbitrary Lorentz manifold, coordinates can 
be constructed which realize these gauge source functions. It would be use- 
ful to have a proof of the universal applicability of this type of gauge and 
information about its general behaviour. 

4.2 The Einstein Euler System 

In the following we shall discuss the Einstein-Euler equations, i.e. Einstein’s 
equation coupled to the Euler equation for a simple perfect fluid. Its hyper- 
bolicity has been studied by various authors (cf. [17], [68], [76]). Though the 
system is also important in the cosmological context, our main concern in 
analysing the system here is to control the evolution of compact perfect fluid 
bodies, which are considered as models for ‘gaseous stars’. In this situation 
arises, besides the need to cast the equations into hyperbolic form, the side 
condition to control the evolution of the timelike boundary along which the 
Einstein-Euler equations go over into the Einstein vacuum field equations. 

The analysis of the evolution of the fields in the neighbourhood of this 
boundary poses the basic difficulty in the discussion of compact fluid bodies. 
In the case of spherical symmetry, this problem was overcome in [61]. If 
in more general situations the coordinates used in the hyperbolic reduction 
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are governed e.g. by wave equations, as is the case in the harmonic gauge, 
there appears to be no way to control the motion of this boundary in these 
coordinates. In [39] a system of equations has been derived from the Einstein- 
Euler system which combines hyperbolicity with the Lagrangian description 
of the flow, so that the spatial coordinates are constant along the flow lines. 
Thus the location of the body is known in these coordinates. In the following 
we shall discuss this system and derive the subsidiary system, which was not 
given in [39]. 



The Basic Equations We shall use a frame formalism in which the in- 
formation on the metric is expressed in terms of an orthonormal frame 
{efc}fc=o ....,3 all fields, with the possible exception of the frame itself, 
are given in this frame. To make the formalism easily comparable with the 
spin frame formalism which will be used later, we shall use a signature such 
that gik = g{ei,ek) = diag(l, — 1, — 1, — 1). Let V denote the the Levi-Civita 
connection of The basic unknowns of our representation of the Einstein 
equations are given by 

e^fc, C'‘ jki, matter variables, 

where k = < 6k,x^ > are the coefflcients of the frame held in some coor- 
dinates k are the connection coefficients, defined by Ck = F^^ k 

and satisfying Fi ^ k gji + Fi^ i gjk = 0, and C® jki denotes the conformal Weyl 
tensor in the frame Cfc. The latter is obtained from the decomposition 

Rijkl — Cijkl {^gi[k ^l]j 9j[k (d-26) 



of the curvature tensor 

R jpq = Sp{Fq j) — eq{Fp j) — Fk j {Fp q — Fq p) (4.27) 

\ j~' i k 7~'* T~' k 

I p k ^ q j ^ q k ^ p j 

where we also set S'^ = i?y — | gij R, with R^j and R denoting the Ricci 
tensor and the Ricci scalar. We shall need the notation 



Ti ^ j 6k — — [cj, 6j] + {Fi * j — Fj \) ei, 

jki = R!' jki - O’- jki - g* [k S'/]j + 5j[fc \ 
with R^ jki understood as being given by (4.27). Furthermore we set 

Fjki =ViF^ jki, with E® jki = C* jki — g* [k Si]j. 

In the equations above we take account of the Einstein-Euler equations 
in the form 

Sik = K {Tik - ^ gjk T), 



(4.28) 
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with an energy-momentum tensor of a simple perfect fluid 



Tjfc = {p + p)U^Uk - pg^k- 


(4.29) 


Here p is the total energy density and p the pressure, as measured by an 
observer moving with the fluid, and U denotes the (future directed) flow 
vector field, which satisfies UiW = 1. 

We shall need the decomposition 


Tjk = Qk + qUk, 


(4.30) 


and the held 

Jjk — ^ [j Qk] 1 


(4.31) 


with 

q = U^V,p+(p + p) V, U\ 


(4.32) 


qk = {p + p) Uk + {Uk W V, - Vk}p. 


(4.33) 



We assume that the fluid is simple, i.e. it consists of only one class of par- 
ticles, and denote by n, s, T the number density of particles, the entropy per 
particle, and the absolute temperature as measured by an observer moving 
with the fluid. We shall assume the first law of equilibrium thermodynamics 
which has the familiar form de = —p dv + T ds in terms of the volume v = ^ 
and the energy e = ^ per particle. In terms of the variables above, we have 

dp = ^ dn + nT ds. (4.34) 

n 

We assume an equation of state given in the form 

P = f(n,s), (4.35) 



with some suitable non-negative function / of the number density of particles 
and the entropy per particle. Using this in (4.34), we obtain 



dp 

P = P) 

an 



T = 



1 dp 
n d 



as well as the speed of sound ly, given by 



,9p 



= (iLL) = ^ 

d p ^ p + p dn' 



(4.36) 



(4.37) 



as known functions of n and s. We require that the specific enthalpy and the 
speed of sound are positive, i. e. 



p + p 



> 0 , 



dp d^ p 
dn ^ dn^ 



> 0 . 



n 



(4.38) 
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We assume the law of particle conservation 

f/*V,n + nVil7* = 0. (4.39) 

It implies together with q = 0 and (4.34) that the flow is adiabatic, i.e. the 
entropy per particle is conserved along the flow lines, 

C7* V* s = 0. (4.40) 

The case of an isentropic flow, where the entropy is constant in space and 
time, is of some interest. In this case the equation of state can be given in 
the form 

P = Hp) (4-41) 

with some suitable function h. As a special subcase we shall consider pressure 
free matter (‘dust’ ), where h = 0. 

We note that (4.35), (4.36), (4.37), (4.39), and (4.40) imply 

[/* V,p = -{p + p) V, U\ 



from which we get 

qk = - Vfe p + (p + p) {W V, Uk - 1^'^ Uk V, W}. (4.42) 

Finally, (4.40) implies the equation 

Sfe = 0) (4.43) 

for Sfc = Vfe s, where Cjj denotes the Lie derivative in the direction of U. 

The Einstein-Euler equations are given in our representation by the equa- 
tions z = 0, q = 0, (4.35), (4.36), (4.37), (4.39), and (4.40), where we denote 
by z the vector- valued quantity 

z={T,^j, F,ku qk). (4.44) 

which we shall refer to (as well as to each of its components) as a ‘zero quan- 
tity’. These equations entail furthermore Jjk = 0 and (4.43). After making 
a suitable choice of gauge conditions we shall extract hyperbolic evolution 
equations from this highly overdetermined system . 

Decomposition of Unknowns and Equations From now on we shall 
assume that 

eo = U, 

so that we have C/® = <5® o • For the further discussion of the equations we 
decompose the unknowns and the equations. With the vector held U we 
associate ‘spatial’ tensor fields, i.e. tensor fields satisfying 

T,,....,,,..,.,.,!/®' =0, 1 = 1,---,P. 
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The subspaces orthogonal to U inherit the metric hij = gij — Ui Uj , and hi ^ 
(indices being raised and lowered with gij) is the orthogonal projector onto 
these subspaces. 

We shall have to consider the projections of various tensor fields with 
respect to U and its orthogonal subspaces. For a given tensor any contraction 
with U will be denoted by replacing the corresponding index by U and the 
projection with respect to hi^ will be indicated by a prime, so that for a 
tensor field Tijk we write e.g. 

etc. Denoting by tiju the totally antisymmetric tensor field with £9123 = 1 
and setting Ejki = e'jjjki, we have the decomposition 

£ijki = 2 (J7[j Cj]ki - £ij[k Ui]), 



and the relations 

e'"' ejpg = "2 e [p ,] , e'"' = -2eh\. 

Denoting by \ Cijpq eu the dual of the conformal Weyl tensor, 

its C/-electric and C/-magnetic parts are given by by Eji = C[jjjji, Bji = C^j^ 
respectively. With the notation Ijk = hjk — Uj Uk, we get the decompositions 

Cijki = 2 {lj[k Ei]i - li[k Ei]j) -2{U[k Bi]p f? ij + U[i BjT^p ki), (4.45) 
C* = 2 C/[, E^^p ePki-4. Ep[, e,j ^ - 4 f/[, B ,] - Bpq ,, e« ki- (4.46) 

We set 

a* = C/fcVfet/\ x^J=h^'^VkUJ, x = h^^X^J. 
so that we have 



V, W = U, a} + Xj \ a* = h, * Tq x^J = " /i/ A 

Since U is not required to be hypersurface orthogonal, the field Xij will in 
general not be symmetric. If the tensor field T is spatial, i.e. T = T', we 
define its spatial covariant derivative by = (V T)' . i.e. 

V T ■ = V T h -^h h p 



It follows that 

A hjk = 0, T>i Cjki = 0. 

Equation (4.40) then implies Sk = T’/t s. 

To decompose the equations, we observe the relations 

q = ^ E[jj ^ , qk = -‘^- Eh^ - 
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and set 

Pi = Puiu 

= - K(g P + 2?^) ^3i\ + 2l^{p + p)a^ + 2x'"^ (P i(i 

Qj = ~2 ^3 ^ukl 

= T>^ Bkj + Cj (2 X* fe — Xk *) Eii + k{p + p)xki Cj 
Pij = {P{i\U\j) ~ 3 ^ij PkUl} = Pu Etj + T>k Bif^i Cj) 

—2 Ofe (i — 3 X(i ^ Ej-^i —2x\i Ej)i + hij x^^ E^i + 2 x 

Kj 1 

E^iP + P) iX(ij) - ^Xhij), 

Qki = 2 £(fc El)ij = Eu Bki - T>i Ej(^k Cl) 

“t”2 0-2 e (fc Bi'^j X {k Bi'ji 2 X(fc Bi^^ -\- x B^i Xij Bpq (P (^k i)- 

Then we find the splitting 

Fjki = 2Uj P[k Ui] + hj[k Pi] + Qi (Uj P ki — P j[k Ui]) (4.47) 
2 Pj]k B[] Qji^ kl 2 ^ ^j[k 3 ^ ^ ^j[k Pi] ■ 

The Reduced System i) In the case of pressure free matter the equation 
Qk = 0 tells us that Cq is geodesic and we can assume the frame field to 
be parallelly transported in the direction of eg. Furthermore, we can assume 
the coordinates a;“, a = 1,2,3 to be constant on the flow lines of eg and 
the coordinate t = a;° to be a parameter on the integral curves of eg. These 
conditions are equivalent to 

Fg^fe = 0, e% = [/'^ = 5%, 

which, in turn, imply together with the requirement p = 0 the relation qk = 0 
if p + p > 0. The unknowns to be determined are given by 

Ik — (e^ fj, Pi) j) Eij ) B)j ) p), 

where a,b = 1,2,3. The reduced system is given by 

To^ a = 0, Zi}gQ = 0, Pij = 0, Qki = 0, q = 0. 



(4.48) 
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ii) In the general case we cannot assume the frame to be parallelly trans- 
ported because the evolution of U is governed by the Euler equations. We 
assume the vector fields 6a, a = 1,2,3, to be Fermi transported in the direc- 
tion of U and the coordinates to be chosen as before such that 

ro% = 0, 0 = = 5 ^^ 0 - 

Our unknowns are given by 

^ Oq ^klt Pi 

and the reduced system by 



To^', = 0, Z\%oa = 0, (4.49) 

Oac J'ua = aOb H J'ab) = Oj 

P + P P+P 

P^J = 0, Qki = 0i g = 0, (4.50) 

Cun = -nx, Cus = 0, Cu Sk = 0, 

where it is assumed that the functions p, a, /3 are determined from the 
equation of state (4.35) according to (4.36), (4.37), etc. and p + p> Q. 

Since the functions p, s, n are then determined as functions of the coordi- 
nates x^, we remark that the relation p{x>^) = f{n{x^),s{x^)) is satisfied as 
a consequence of the equations for p, s, and n, and the relation (4.36), since 
q = 0 implies dt P = ^ f{'<^i s). Furthermore, we note that in our formalism 
P'U — dt (-^0 ^ k Pk ^ o) k etc. 

When we solve the equations Pij = 0 and Qu = 0, the symmetry of 
the fields Eij, B^i has to be made explicit. The trace-free condition then 
follows as a consequence of the equations and the fact that the initial data 
are trace-free. With this understanding it is easy to see that the system (4.48) 
is symmetric hyperbolic, the remaining equations only containing derivatives 
in the direction of U. To see that the system consisting of (4.49) and (4.50) 
is also symmetric hyperbolic, we write out some of the equations explicitly 
(taking the opportunity to correct some misprints in [39]). It follows directly 
from the definition that 

Jkj = {p + p) {U^ (Vfe V, Uj - V, V, Uk) - Uj Vfe V, [/* (4.51) 

+1/2 Uk V, V, W - i/2 Vz C/' (Vfc Uj - V, Uk) + Vfc U^ V, Uj - Vj W V, Uk 

+e ^ (0)« Vz U^ {Uk W V, Uj - Uj W V, C/fc)| 

+{a Uk U^ V, Uk) Sj - (a U, V,U^-j3 W V., U,) szo. 
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where we set 




In particular, 




+ 



f^(S) 

\ o 




T~< c 7-1 0 ^ pc „ 

^ c 0 ^ 0 a , ^ c 0 ? 

p + p 



and 




From this follows that the last two equations of (4.49) are given by 



The remaining equations of (4.49) and (4.50) (besides Pij = 0, Qki = 0) 
again only contain derivatives in the direction of U. 

The Derivation of the Subsidiary Equations We have to show that any 
solution to the reduced equations which satisfies the constraints on an initial 
hypersurface, i.e. for which z = 0 on the initial hypersurface, will satisfy z = 0 
in the domain of dependence of the initial hypersurface with respect to the 
metric supplied by the solution. For this purpose we will derive a system of 
partial differential equations for those components of z which do not vanish 
already because of the reduced equations and the gauge conditions. 

We begin by deriving equations for There exist two different expres- 
sions for Fki = V-1 Fjki- From the definition of Fjki and from the symmetries 
of the tensor field involved follows 




(4.54) 





iz'^dtFa\-iz^eb{Fo\) = iy^ {Fk\{FoK-Fa\) (4.55) 

-Fo\Fa^, + R° b0a + /^0°c(/^a%-n^) 
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Fki = V^' V* = V^' V* C,jki - V^' V[fe 5,],- 

= -RP [, c,]pM + RP fc] + \r,p,Vp M 

+ V [fe V^' Si^,- RP [i ^ fc] Sp, -RPj^ [fc + Vp [I Tk] P J , 

where we took into account that we do not know at this stage whether the 
connection coefficients Fi^ k supplied by the solution define a torsion free 
connection. From the reduced field equations, the definition of the zero quan- 
tities, and the symmetries of Cijki, it follows that 

Fki = -AP + C^Mi + 0 Vp ki (4.56) 

+K Jki + AP fc] Sp, - AP j ^ [fe = N{z), 

where N{z) is, as in the following, a generic symbol for a smooth function 
(which may change from equation to equation) of the zero quantities which 
satisfies fV(0) = 0. 

On the other hand, because of the reduced equations, equation (4.47) 
takes the form 

^jki — 2 Uj P^k F hj^k Pi] F Qi {Pj ^ ki c j^k Pi]) 2 P^^j[k Qi]- (4-57) 

Contracting with V^ , decomposing the resulting expression into F'j^i and 
equating with the corresponding expressions obtained from (4.56), we arrive 
at equations of the form 



Cu Pk F^ek V, Q, = N{z) (4.58) 

CuQk-\ek^^V,P^=N{z). (4.59) 

The connection defined by the Fi^ k and the associated torsion and cur- 
vature tensors satisfy the first Bianchi identity 

J2^jTkP=Y.(^'jkiFT,"^kTPm), 

(jkl) (jkl) 

where J2{jki) denotes the sum over the cyclic permutation of the indices 
jkl. Setting here j = 0, observing that the symmetries of jki, ^ki imply 
’^{jki) jkl = J2(jki) jkl, and taking into account the reduced equations, 
we get from this an equation of the form 



eo{Ta\) = N{z). 



(4.60) 
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To derive equations for jki, we use the second Bianchi identity 

^ V, mkl = - ^ mnj I- 



mkl — 



mnj 



(4.61) 



Uki) 



Ukl) 



We write 



R'‘ jki — A^ jki + C* jki + if* jki + G* jki, 
with S'- jki = g* [fc \ G" jki = I S [kgi]j and S*^ = Sjk-\Sgjk- 



Using the well known facts that the left and right duals of Cijki and Gijki 
are equal while the left dual of Eijki differs from its right dual by a sign, and 
using the reduced equations, we get 



i) In the case of pressure free matter we get from (4.62), by setting j = 0 and 
using (4.48), an equation of the form 



Since, as remarked earlier, = 0 by our gauge conditions, the system of 
equations consisting of (4.58), (4.59), (4.60), and (4.63) constitutes the de- 
sired ‘subsidiary system’ for the zero quantities in the pressure free case. 

ii) Using (4.49), we get in the general case by the analogous procedure 
only an equation of the form 



We can express the equations for Z\° [b|o|c]j A^ abc- in terms of the quantities 



^ ) Vj mkl — ^ ^ j ^ mkl 



(jki) (jki) 



Ukl) 




ITS 



— ^ ^ j ^ mkl 4” 2 ^jkl ^ K gpj- (7s) Cjji 

Ukl) 

whence, by (4.57) and (4.61), 



irs 




(4.62) 



Ukl) 



eo(W kab) = N{z). 



(4.63) 



eo(2l“ bed) = N{z). 



(4.64) 



By the antisymmetry of J'j, we know already that 

A'^ aOb + A'^ bOa = 0. 




0ac5 



(4.65) 
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because 

[b|0|c] = -‘^a e“ ho -4° abc = ~ ^*ad ^bc ‘^ + 2 ha[b ^*] • 

From (4.62) we get 

eo abc) + Cc (2\° aOb) + Cb (-4° aco) = ^ {z) ^ 

which implies for the quantities (4.65) equations 

2eo(z4:)-e„ et, {A^) = N {z) , 

and 

eo {Al^) - e(„ {Ad)) + hab {A^) = N{z). 

Since we have 

e, (Z\b) = ea{^JL) = ~l^ V. V, qk+N{z) = N{z), 

2 P + P 2 p + p 

we can write the second equation in the form 

eo{A:d)-e^a{Ad)) = N{z). 

From (4.62) we get furthermore 

Ca {A^ bed) + 6d (2\^^ hoc) + 6c (2\° fc^ja) = N{z), 

which implies in terms of the quantities (4.65) an equation of the form 

(AD-ea^^^CaiAl). 

By a direct calculation we derive from (4.31) the equation 

2 Cu J'j = 4 - h, P h, « {A^ apq + A^ qnp + A^ pqn) qi 

-h,Ph,<l{Ti^pVn qg+T,^lVnqp + Tp^q^nqi)U^ 

—2 Qi Jjjj + 2 Qj Jm, 

which can be rewritten by (4.49) in the form 

{p + p) {eo {Aa) + 2iz^€a 6b (^^} = ^ i^) ■ (4-66) 

From the equations above we obtain the system 

(p + p) (2 1/2 eo {AD - z/2 ea Cb {A^)} = N{z), (4.67) 

(p + p) {eo {Aa) + 1/2 6a Cf, (Z\*) + J/2 6b {A^)} = N {z) , (4.68) 

Cab {p + p) {v'^eo {Alb) - (^b))} = N{z), (4.69) 
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where Cab = 1 if a = 6 and Cab = 2 if a ^ 6. Finally, we obtain from (4.31), 
(4.49) the equation 

£(7(?a = 2j/^(p + p)Z\^0ac- (4.70) 

Equations (4.58), (4.59), (4.60), (4.64), (4.67), (4.68), (4.69), and (4.70) con- 
stitute the subsidiary equations for the zero quantities in the general case. 

We note here that in the reduced system it has not been built in explic- 
itly that the energy-momentum tensor has vanishing divergence (cf. equation 
(4.30)). While we assumed the equation g = 0 as part of the reduced equa- 
tions, we verify the vanishing of the quantity by deriving the subsidiary 
equations and using the uniqueness property for these equations. 

In the present formalism the gauge conditions are taken care of by the 
explicit form of some of the unknowns, however the list of constraints is 
much longer than in the previous discussions. We shall not try to demon- 
strate that the constraints are preserved in the specific case of ‘floating fluid 
balls’.. Though the construction of data for fluid balls of compact support 
which are embedded in asymptotically vacuum data has been shown and 
their smoothness properties near the boundary have been discussed [70], the 
evolution in time of these data and the precise smoothness properties of the 
fields near and possible jumps travelling along the boundary have not been 
worked out yet. However, without a precise understanding of the behaviour 
of the solution near the boundary the conservation of the constraints cannot 
be demonstrated. 

Our reduced system also found applications in cosmological context where 
the fluid is spread out, with p + p > 0, over the time slices (cf. [31], [79]). 
In this case the desired conclusion follows from the fact that the subsidiary 
systems are symmetric hyperbolic, have right hand sides of the form N{z), 
and the characteristics of the reduced system and the subsidiary system are as 
follows (where we use only the frame components k of the covector 

0 - 

(i) In the case of pressure free matter the characteristic polynomial of the 
reduced system is of the form 

with positive integers K, L, N and constant factor c, while the characteristic 
polynomial of the subsidiary system only contains the first two factors. Thus 
the characteristics of the subsidiary system are timelike with respect to 
(cf. also the remarks in Sect. 2). 

(ii) In the general case the characteristic polynomial of the reduced equa- 
tions is of the form 

C (^o)"^ (^o" + J Ca 6)^ {^0 + {g^'' Cm (4-71) 

with positive integers K, L, M, N and constant factor c, while the character- 
istic polynomial of the subsidiary system is generated by powers of the first 
three factors. 



The Cauchy Problem 185 



We note that the equations (4.54) and (4.55) contribute to the principal 
symbol the third factor which corresponds to the sound cone pertaining to the 
fluid. If one wants to ensure that the sound does not travel with superluminal 
speed one has to require the equation of state (4.35) to be such that < 1. 
For the question of existence and uniqueness of solutions there is no need 
to impose such a condition, but ii v > 1, the domain of dependence with 
respect to as it has been deflned in Sect. 2, cannot be shown any longer, 
by the arguments given in Sect. 2, to be also a domain of uniqueness. In any 
case, it follows from the characteristic polynomials that in a domain where 
the solution of the reduced system is unique according to those arguments, 
the constraints will be satisfied if they hold on the initial hypersurface. 

We note that the system simplifies considerably in the isentropic case. In 
the reduced system the function p+p then neither occurs in the principal part 
nor in a denominator. In the case of the subsidiary system a more detailed 
discussion is required to understand the consequences of the occurrence of 
the various factors p + p- 

In our procedure the fluid equations serve two purposes, they determine 
the motion of the fluid as well as the evolution of the frame. If we set k = 0 
in all equations the fluid equations decouple from the geometric equations 
and we obtain a new hyperbolic reduction of the vacuum held equations. In 
this procedure any exotic ‘equation of state’ may be prescribed as long as it 
ensures a useful, long-lived gauge. 

If the initial data for the Einstein-Euler equations are such that [/, p, p, 
n, s, and the equation of state can be smoothly extended through the bound- 
aries of the fluid balls, this suggests using the ‘extended fluid’ to control the 
evolution of the gauge in the vacuum part of the solution near the boundary. 

4.3 The Initial Boundary Value Problem 

In the previous section we studied a problem involving a distinguished time- 
like hypersurface. Its evolution in time was determined by a physical pro- 
cess. There are also important problems where the Einstein equations are 
solved near timelike hypersurfaces which are prescribed for practical reasons, 
e.g. to perform numerical calculations on finite grids. The underlying initial 
boundary value problem for Einstein’s held equations, where initial data are 
prescribed on a (spacelike) hypersurface S and boundary data a (timelike) 
boundary T which intersect at a 2-surface E = T D S, has been analysed in 
detail in the article [42]. The solution to this problem requires a hyperbolic 
reduction which needs to satisfy, beyond the conditions discussed at the be- 
ginning of this section, certain side conditions. In the following we want to 
comment on those aspects of the work in [42] which illustrate the flexibility 
of the held equations in performing reductions and on certain characteris- 
tics of the reduced system. For the full analysis of the initial boundary value 
problem we refer to [42]. 
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Since we are dealing with a problem for equations which are essentially 
hyperbolic, the problem can be localized. In suitably adapted coordinates x^, 
defined on some neighbourhood of a point p £ S, the manifold M on which 
the solution is to be determined will then be given in the form M = {x € 
> 0,a;^ > 0}, the initial hypersurface by S' = {a; € M\x^ = 0} and the 
boundary by T = {a: £ M\x^ = 0}. Clearly, we will have to prescribe Cauchy 
data on S as before, we will have to analyse which kind of boundary data are 
admitted by the equations, and on the edge S = {x & M\x^ = 0, x^ = 0} 
the data will have to satisfy some consistency conditions, as is always the 
case in initial boundary value problems. 

Maximally Dissipative Initial Boundary Value Problems We have 
seen in Sect. 3.1 that energy estimates provide a basic tool for obtaining 
results about the existence and uniqueness of solutions to symmetric hyper- 
bolic systems. To explain the side conditions which have to be satisfied in a 
hyperbolic reduction of an initial boundary value problem for Einstein’s field 
equations, we consider what will happen if we try to obtain energy estimates 
in the present situation. Assume that we are given on M in the coordinates 
x^ a linear symmetric hyperbolic system of the form 

A^^^^u = Bu + f{x), (4.72) 

for an R^-valued unknown u. The matrices = A^{x), p = 0, 1,2,3, are 
smooth functions on M which take values in the set of symmetric N x N- 
matrices, there exists a 1-form such that is positive definite, B = 

B{x) is a smooth matrix-valued function and f{x) a smooth R^-valued func- 
tion on M . For convenience we assume that the positivity condition is satisfied 
with = S° f,. 

If we assume that u vanishes for large positive values of x“, a = 1,2,3, 
and if the relation 

d^(*u A^ u) = K u + f with K = B+*B + dfj,A^, 

implied by (4.72), is integrated over a set Mt = {a: £ M|0 < x** < r}, defined 
by some number r > 0, we obtain the relation 

f *uA^udS= f *uA'^ udS + ( \*u K u + 2^u f} dV + f ^uA^udS, 

JS-r JS J JT-r 

involving boundary integrals over S',- = {x £ M|x° = r} and Tt = {x ^ 
M|0 < x° < r, x^ = 0}. Obviously, the structure of the normal matrix A^ 
plays a prominent role here. If the last term on the right hand side is non- 
positive, we can use the equation above to obtain energy estimates for proving 
the existence and uniqueness of solutions. 

By this (and certain considerations which will become clear when we have 
set up our reduced system) we are led to consider the following maximally 
dissipative boundary value problem. 
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We choose g G C°°{S,R^) and require as initial condition u{x) = g{x) 
for X £ S. We choose a smooth map Q of T into the set of linear subspaces 
of and require as boundary condition u{x) G Q{x) for x £ T. The type 
of map Q admitted here is restricted by the following assumptions. 

(i) The set T is a characteristic of (4.72) of constant multiplicity, i.e. 

dim(ker7l^(a:)) = const. >0, x £T. 

(ii) The map Q is chosen such as to ensure the desired non-positivity 

^uA?{x)u<ff, u£Q(x), X £ T. 



(iii) The subspace Q(x), x £ T, is a maximal with (ii), i.e. the dimen- 
sion of Q(x) is equal to number of non-positive eigenvalues of counting 
multiplicity. 

The specification of Q can be expressed in terms of linear equations. 
Since is symmetric, we can assume, possibly after a transformation of the 
dependent variable, that at a given point x £ T 



A^ = K 



-Ij 0 0 

0 Ofc 0 
0 0 



K > 0, 



where Ij is a j x j unit matrix, Ofc is a fc x fc zero matrix etc. and j + k + l = N. 
Writing u = *(a, b, c) G x x we find that at x the linear subspaces 
admitted as values of Q are neccessarily given by equations of the form 0 = 
c — H a where H = H{x) is a I x j matrix satisfying 

-*aa+ Ha<0, a£W, i.e. ^HH<Ij. 



We note that there is no freedom to prescribe data for the component b of u 
associated with the kernel of A^. More specifically, if = 0 on T, energy es- 
timates are obtained without imposing conditions on T and the solutions are 
determined uniquely by the initial condition on S. By subtracting a suitable 
smooth function from u and redefining the function /, we can convert the 
homogeneous problem above to an inhomogeneous problem and vice versa. 
Inhomogeneous maximal dissipative boundary conditions are of the form 



q = c-Ha, (4.73) 

with q = g(x), x £T, a given Revalued function representing the free bound- 
ary data on T. 

Maximally dissipative boundary value problems as outlined above have 
been worked out in detail in [74], [83] for the linear case and in [48] [84] for 
quasi-linear problems (see also these articles for further references) . If we want 
to make use of this theory to analyse the initial boundary value problem for 
Einstein’s field equations we will have to solve two problems which go beyond 
what is known from the standard Cauchy problem. 
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(i) We will have to find a reduction, involving a symmetric hyperbolic 
system, which gives us sufficient information on the normal matrix so that 
we can control the conditions above. 

(ii) To demonstrate the preservation of the constraints we will have to 
discuss an initial boundary value problem for the subsidiary system. This 
should be such as to admit a uniqueness proof. Moreover, there is the problem 
of getting sufficient control on the solution near T. While the initial data for 
the reduced system will of course be arranged such that the constraints are 
satisfied on the initial hypersurface S, it will a priori not be clear that we will 
have sufficient information on the behaviour of the solution to the reduced 
equations and on the data on T in order to conclude that the constraints will 
be satisfied on T. 

The choice of representation of the field equations, of the gauge conditions 
and the gauge source functions, and, in particular, the choice of the reduced 
equations will largely be dominated by the second problem. 



The Representation of the Einstein Eqnations In [42] the initial bound- 
ary value problem for Einstein’s vacuum field equations was analysed in terms 
of the equations 

T,'=,=0, A^jki = 0, (4.74) 

of the previous section with everywhere vanishing energy- momentum tensor. 
We shall use these equations together with the conventions and notation 
introduced in the previous section. 



The Gange Conditions The gauge, which we assume here for simplicity 
extends to all of M, has been chosen as follows. On the initial hypersurface 

= 0 and x“, a = 1,2,3, are coordinates with = 0 on S and > 0 
elsewhere. The timelike unit vector field cq on M is tangent to T, orthogonal 
to the 2-surfaces Sc = {x^ = c = const. > 0} in S, and it points towards M 
on S' n C/. The coordinates x^ satisfy o = eo(x^) = o on M and the sets 
Tc = {x^ = c} are smooth timelike hypersurfaces of M with Tq = T. The unit 
vector field 63 is normal to the hypersurfaces Tc and points towards M on T. 
The vector fields ca, A= 1,2, are tangent to Tc fl S and such that they form 
with Co, 63 a smooth orthonormal frame field on S. On the hypersurfaces Tc 
these fields are Fermi transported in the direction of eg with respect to the 
Levi“Civita connection D defined by the metric induced on Tc- The form 
a smooth orthonormal frame field on U. We refer this type of gauge as an 
‘adapted gauge’. Notice that it leaves a freedom to choose the timelike vector 
field eo on M \ S. 

In analysing the initial boundary value problem it will be necessary to 
distinguish between interior equations on the submanifolds S, T, Tc, S, Sc- 
Since our frame is adapted to these submanifolds, this can be done by dis- 
tinguishing four groups of indices. They are given, together with the values 
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they take, as follows 

a, c, d, e, / = 0, 1, 2; i, j, k, l,m,n = 0, 1, 2, 3; 

p,q,r,s,t = 1,2,3; A, B,C,D = 1,2. 

We assume the summation convention for each group. 

By our conditions the frame coefficients k satisfy 

= e3„ = 0, e%>0 on M, (4.75) 

while the frame connection coefficients satisfy 

-To ^ B = 0. (4.76) 

The fields Ca satisfy on Tc the equations 

Deo eo = 0 CA, DeoCA = —gAB Pq^ 0^0- (4.77) 

Thus, given the hypersurfaces Tc, the evolution of the coordinates x°‘, a = 

0,1,2, and the frame vector fields off S is governed by the coefficients 

r ^ 

^ 0 0 - 

Another part of the connection coefficients defines the intrinsic connec- 
tion D on Tc, since DaCc = = Pa^ c^b- The remaining connection 

coefficients, given by 

Xab = d(Ve„ 63, Cfo) = Pa^ 3 9jb = Pa^ b = P {a ^ b), (4.78) 

define the second fundamental form of the hypersurfaces Tc in the frame Bq. 
In the reduced equations, the symmetry of Xab has to be taken into account 
explicitly. A special role is played by mean extrinsic curvature 

X = g^^Xab = g^'^Pj\ = y^e^^3, (4.79) 

since it can be regarded as the quantity controlling the evolution of the hy- 
persurfaces Tc and thus of the coordinate x^. 

We now choose two smooth functions G C°°{M) as gauge source func- 
tions. These will occur explicitly in the reduced equations and will play the 
role of connection coefficients for the solution, namely = Pq^ o- Further- 
more we will choose a function / G C°°{M) which will play the role of the 
mean extrinsic curvature on the hypersurfaces Tc- Here the interpretation is 
somewhat more complicated. On T the function X = /It must be regarded 
as the free datum which, together with certain data on S, indirectly spec- 
ifies the boundary T. However, for > 0 the function / plays the role of 
a gauge source function which determines the gauge dependent hypersurface 
Tc, c > 0. It is a remarkable feature of the Codazzi equations that they admit 
this freedom while at the same time implying hyperbolic equations. 

This example clearly shows the importance of the freedom to dispose of 
the gauge source functions. While we could choose F^ = 0 locally (cf. the 
remarks in [42] about certain subtleties arising here), we need the full freedom 
to make use of the gauge source functions /, since otherwise we could only 
handle restricted types of boundaries. 



190 



Helmut Friedrich and Alan Rendall 



The Reduced Equations Using the gauge conditions above, we extract 
from (4.74) the following reduced system for those components of the un- 
knowns k, Eij, Bki which are not determined already by the gauge 

conditions and the chosen gauge source functions. Where it has not already 
been done explicitly, it is understood that in the following equations the con- 
nection coefficients To o and x are replaced by the gauge source functions 
and / respectively. The torsion free condition gives 

0 = -To%e^fc = dte%-(To%-/;%)e%-ro%(5%. (4.80) 

The Gauss equations with respect to provide the equations 

0 = ooA = eoiFA ^ o) - eA{F^) + ^ o ^ o (4.81) 

—Fa ^ c + E^ F'" gAc + Xo ^ Xao ~ Xa^ Xoo ~ qoa, 

0 = COA = eo{FA ^ c) + E^ Fa^ c + Ea^ oE’^ geo (4.82) 

+Ed ^ c Ea^ 0 + Xo^ Xac - Xa^ Xoc - coa- 
Codazzi’s equations with respect to imply 



0 = 


gab ^3 


abl 


= Dq xoi — Di 


Xii ~ D 2 X 12 — 


Diif), 


(4.83) 


0 = 


gab ^3 


ab2 


= Do X 02 — Di 


X 12 — D 2 X 22 — 


772(/), 


(4.84) 




0 = 


A^ 


101 = Do Xii — 


77i Xoi — 101 


1 


(4.85) 


+ ZS 


102 = 


■ 2 71q X 12 — Di X 02 


— D 2 Xoi — : 


201 — 102 , 


(4.86) 




0 = 




202 — Dq X 22 — 


D 2 X 02 — 202 


1 


(4.87) 



where it is understood that the component xooi which appears only in un- 
differentiated form, is given by xoo = Xii + X 22 + /• The remaining Ricci 
identities give 

0 = A^ bo3 = 60(73 b) + E"^ Es° b + Es"^ 0 E^ gBC + 7c b T 3 0 (4.88) 
+73 "^s7l3^o + Xo"^7^3^B — T^3"^3 Xob ~ Ec^ b Xo^ ~ b03, 

0 = A"^ 003 = 60(73 0 ) — 63 (F"^) -I- xo 7a ^ 0 — 7a ^ b E^ + Eb^ 0 E^^ q 

(4.89) 



The Cauchy Problem 191 



ipA p 3 A B p 3 ri^A t^A pfA 

+ I3 0I3 0~^B 0X0 —J^3 B9 XOO — T XOO — 003, 



0 — A03 + 03A — eo(/3 ^ a) — S-A^r^ ^ o) (4.90) 

+A ^ 0 9ba + ^3 ^ c ^a 0, 



0 = Z\3 ^ 3 ^ gAB 3 (4 9^) 

^ab 3 T' k I 3 t' k i 7~i3/7~'m m \ 

—9 J-3 k^b a-\-g J-b k^3 a 9 ^ m a\J~ 3 b ~ J~ b 3j- 



In the previous section we saw how to extract a symmetric hyperbolic 
system from the Bianchi identities. However, for reasons given below, we 
shall not choose that system here. Instead we choose the ‘boundary adapted 



system’ 

Pii — P 22 = 0 

2Pi 2 = 0 
Pii + P 22 = 0 

^*13 = \Q2 
P 23 = —^Qi 



Qii — Q 22 — 0 

2Qi2 = 0 

Qii + Q22 = 0 (4.92) 

C13 = ~\P2 
Q 23 = \Pl^ 



written as a system for the unknown vector u which is the transpose of 



((P-, 2Pi2, P+, P13, P23), (P-, 2Pi2, P+, 



P13, ^23))- 



Here E± = En ± E 22 , and B± = Bn ± B 22 and it is understood that the 
relations E^ = 0 and g®-l Bij = 0 are used everywhere to replace the fields 
and P.3.3 by our unknowns. Written out explicitly, this system takes the 
form (P + u = &, with 



= 



0 

0 ’ 



= 



0 Af^' 
TAt^ 0 ’ 



where 



'1 


0 


0 


0 


o' 




0 




0 


e^2 


e^i ■ 


0 


1 


0 


0 


0 




e>^3 


0 


0 


-e'^i 


e^2 


0 


0 


1 


0 


0 


, = 


0 


0 


0 


e^2 


-e^i 


0 


0 


0 


1 


0 




-e'^2 


e'^i 




0 


0 


1 

0 


0 


0 


0 


1 




-e'^i 


-e'^2 


e'^i 


0 


0 



(4.93) 



The reduced system consisting of (4.80) to (4.92), is symmetric hyperbolic. 
However, beyond that the choice of this particular system was motivated by 
the following specific features. 

(i) The theory of maximally dissipative initial value problems applies to 
our reduced equations. In equations (4.80) to (4.91) the derivative which 
by our gauge conditions occurs only with the directional derivative 63, is ap- 
plied to the gauge source functions but not to the unknowns, while (4.93) 
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shows that we have perfect control on the non-trivial part of the normal ma- 
trix arising from (4.92). Our discussion of maximally dissipative initial value 
problems, which led to (4.73), and the form of the matrices (4.93) suggest 
that we can prescribe besides the datum which characterizes the boundary, 
precisely two free functions as boundary data on T. This is confirmed by the 
detailed discussion in [42], though in general a number of technical details 
have to be taken care of . 

(ii) If instead of (4.92) we had chosen the system Pij = 0, Qki = 0 as 
equations for the electric and magnetic part of the conformal Weyl tensor, 
the theory of maximally dissipative initial value problems would also have 
applied. We would, however, have come to the conclusion that besides the 
mean extrinsic curvature four functions could be prescribed freely on T. This 
apparent contradiction is resolved when one tries to show the preservation 
of the constraints, i.e. that those equations contained in (4.74) are satisfied 
which are not already solved because of the gauge conditions and the reduced 
equations. In the case of the reduced equations above this can be shown for 
the following reason. The subsidiary system splits in this case into a hierachy 
of symmetric hyperbolic subsystems with the following property. The first 
subsystem has vanishing normal matrix on T. This implies under suitable 
assumptions on the domain of the solution to the reduced equations that all 
unknows in this subsystem must vanish, because the data on S are of course 
arranged such that all constraints are satisfied. Furthermore, it follows for any 
subsystem in the hierarchy that its normal matrix vanishes if the unknowns 
of all previous subsystems in the hierarchy vanish. From this the desired 
conclusion follows in a finite number of steps. If we had considered instead 
the system Pij = 0, Qki = 0, the discussion whether the constraints are 
preserved would have become quite complicated and would have led us in 
the end to the conclusion that only two functions are really free on T while 
the others are subject to restrictions determined by the evolution properties 
of the reduced system. 

We end our discussion of the initial boundary value problem with an 
observation about the characteristics of the reduced system. Equations (4.80) 
to (4.91) contribute a factor of the form 

(72 (72\Z/ / r % <72 ^2 t ‘ 2 \M 

SO lS0~Sl~S2J l^S0~Sl~S2J 5 

to the characteristic polynomial (using again only the frame components of 
the covector ^). The corresponding characteristics are timelike or null with 
respect to g^,j. However, the subsystem (4.92) contributes a factor 

fo" (^ 0 ^ - e? - - 2^1 - ?|)^. 

If we denote by the 1-forms dual to the vector fields e^, so that < (j\ek > 
= 6^ k and ~ <^jj, the characteristics associated 

with the third factor in the polynomial above can be described as the null 
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hypersurfaces with respect to the metric 



7 00 ^ 11^22 



3 ^3 

/i ^v 



(4.94) 



4.4 The Einstein Dirac System 

Apparently, not much has been shown so far about the existence of solutions 
to the Einstein-Dirac system for general data. The initial value problem for 
this system was considered in [9] , but no existence theorem for the evolution 
equations was proved there. J. Isenberg has suggested to us that it should 
be possible to show the well-posedness of the equations by formulating the 
equations as a system of wave equations in a way similar to what was done 
for the Cauchy problem for classical supergravity in [8]. However, this idea 
has not been worked out in the literature. 

We shall indicate here how to obtain symmetric hyperbolic evolution equa- 
tions from the Einstein-Dirac system. To avoid lengthy calculations, we shall 
not discuss the complete reduction procedure but only use this system to 
illustrate certain questions arising in the reduction. 

We shall write the equations in terms of the 2-component spin frame for- 
malism, which may be thought of as the spinor version of the frame formalism 
used in the previous sections. The fields and the equations will be expressed 
in terms of a spin frame {ia}a=o,ij which is normalized with respect to the 
antisymmetric bilinear form e in the sense that it satisfies tab = e(ta, ^b) with 
£oi = 1. The associated double null frame is given by Caa' = ta ta', it satisfies 

^aa' — ^aa' and (]{Caa' •, ^bb'^ — ^ab^a'b' 

All spinor fields (with the possible exception of the basic spin frame itself 
and the vector fields Boo') will be given with respect to the spin frame above 
and we shall use Cab and e“^, defined by the requirement (the 

Kronecker symbol), to move indices according to the rule cub, bJa = 

f-ba- 

We use the covariant derivative operator V acting on spinors, which is 
derived from the Levi-Civita connection of g and satisfies V Cab = 0, to 
define connection coefficients Faa'bc = Fg^a'^bc) by 

^ aa' ^b ^ ^aa' ^aa' b ^c- 

For any spinor field o;“ we have 

{Vcc' Vrfd' — Vdrf/ Vcc')^^ = bcc'dd' — Tec' dd' 

with vanishing torsion 

0 — Tjjfjf cc' ^dd' — ^ bb' ^cc' ^ cc' ^bb' t ^cc'] (4.95) 

— Tbb' c^dc' T})})! c' ^cd' Tqc' b^db' Tqc' b' ^bd' 
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and curvature spinor field 

Rabcc' dd' — ^dd' {^cc' ah') ^cc' i^^dd' ah') “t“ ^dd'ae ^cc' b (4.96) 

~^^ed' ab ^cc' d ^cc* ae ^dd' b ^ec'ab ^dd' c 

j_7~' 7^ e' e' rp ee' p 

1 -^ de'ab ^ cc' d' ^ ce'ab ^ dd' c' -^cc' dd' ^ ee'ab’ 

The latter has the decomposition 

Rabcc'dd'— ^abcd^c'd' ^abc'd'^cd~^ ^c' d' {^bd ^ac ^ad ^bc) : (4.97) 

into the conformal Weyl spinor field 'I'abcd = 'l^{abcd) as well as the Ricci spinor 
'^aba’b' = ^(ab)(a'b') = ^aba'b' and the Scalar A, which allow us to represent 
the Ricci tensor in the form 



^aa'bb' — 2 ^aba'b' “t“ 6 rl Cab ^a'b' ■ 



The Bianchi identity reads 

o' 'I'abcf = V(o ^bc)a’f- (4.98) 

The Field Equations The Einstein-Dirac system is specified (cf. [72]) by 
a pair of 2-spinor fields 4>a, Xa' satisying the Dirac equations 

V“o'«f>a = MXa', Vo“'xo' = (4.99) 

with a real constant /r, and the Einstein equations with energy-momentum 
tensor 

Taa'bb' — bb' 4^ a' ^ bb'4^a' ~t” ^’b ^ aa' 4^h’ 4^b' ^ aa' 4^b (4. 100) 

Xa ^ bb'Xa' 4“ Xa' ^ hb'Xa Xb ^ aa'Xb' 4“ Xb' ^ aa' Xb} • 

The Einstein equations then take the form 

A=--^{4>ar-~4>a'X^'), (4.101) 

^aba'b' 2 ^4^{a ^ b){a' 4*b') 4^{a' ^ b'){a4^b) (4.102) 

X{a ^ b){a'Xb') 4“ X{a' ^b')(aXb)}. 

The discussion of this system is complicated by the fact that the Dirac 
equations are of first order while derivatives of the spinor fields also appear 
on the right hand side of (4.102). Consequently, the right hand side of the 
Bianchi identity (4.98) is given by an expression involving the derivatives of 
the spinor fields from zeroth to second order. Therefore we need to derive 
equations for these quantities as well. 
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By taking derivatives of the Dirac equations and commuting derivatives, 
we obtain 

V“ a'’^bb'4'a = ~R^ a “ a'bb' <Ph + fJ-'^bb’Xa' (4.103) 

V“ cc''^ bb' 4>a = —S/ cc' a “ a'bb' 4’h (4.104) 

R a a'bb' ^cc'4^h 4" bb'^ cc'Xa' R b a' cc' ^hb'^a 
— R^ b' “ a' cc' '^bh'4'a ~ R!^ a' cc' '^bb'(t^h 

and similar equations for the derivatives of \a' ■ It is important here that the 
curvature quantity 



i?" 



a'bb' 



= — da'b' — 3/1 ea'b'f-d ^ 



which occurs in these equations does not contain the conformal Weyl spinor. 
We can use (4.101) and (4.102) to express R^ a^" a'bb' and its derivative in 
(4.103) and (4.104) in terms of the spinor fields and their derivatives to 
obtain a complete system of equations for cj)a, ’^bb"Pa, ^ cc'^bb'4>a and the 
corresponding fields derived from Xa' ■ 

These fields are not quite independent of each other. If we define sym- 
metric fields cj)aca', 4>abca'b', Xaa'c', Xaba'b'c' by Setting 



(f}ac°'' = V(a“Vc), 
we get from the Dirac equations 



^abc^'^' =y (a 

Xab =V(, 



^ aa' 4^b 



2 



^ab Xa ' : 



1 ^2 ^ 

^ cc'^bb'4'a — 4^abcb'c' ^ ^b'c' ^abch 4' 4“ ^ ^c(a ^b)hb'c' 4 

2 1 2 

4”2 yl 4{a^b)c ^b'c' 4“ ~ ir (ia{b Xc)b'c' 2 ^ ^b'c ' ; 



and similar relations for the derivatives of Xa'- From these, the equations 
above, and (4.98), (4.101) and (4.102) we can derive equations of the form 



V a'4abb' — ^ba'b't ^ a'4abcb'c' ~ ^bca'b'c'J (4.105) 

Va “ Xba'b' = N^bb'^ “ Xbca'b'c' = ^abcb'c'^ (4.106) 

where denote functions of 4a, Xa', 4aca', Xaa'c', while 

^abcb'c' depend in addition on 4abca'b', Xaba'b'c', and ^abcd- Note that this 
introduces (or rather makes explicit) further non-linearities. 
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Hyperbolic Equations from the Einstein Dirac System Our field 
equations for the unknowns 

k: ^aa'bci ^abcd^ ^abb' : 4^abcb'c'i ?Ca' j \ba'b' i \bca'b'c'^ 

are now given by (4.95) and (4.97) (with the left hand side understood as 
being given by (4.96)), (4.98), (4.99), (4.105) and (4.106). Here Einstein’s 
equations (4.101) and (4.102) are used to express quantities derived from the 
Ricci tensor in terms of the spinor fields and their derivatives. 

When we try to deduce a hyperbolic reduced system from these equations, 
the first two equations, which determine the gauge dependent quantities, will 
require the choice of a gauge, while we expect the remaining equations, which 
are tensorial, to contain subsystems which are hyperbolic irrespective of any 
gauge. This is indeed the case and there are, due to the fact that most of the 
equations are over determined, various possibilites to extract such systems. 

In [36] sytems of spinor equations have been considered which are built 
from systems of the type 



^ a ' — ^a' p ^ '4^ cj') 1 

or their complex conjugates, where (3 denotes a multi-index of some sort. 
If the components corresponding to different values of the indices b, /3 are 
independent of each other the equations 

— 0 ' i’bp = —Fq'p 

1 ' 'ipbp = Fi’p 

form a symmetric hyperbolic system. Equations (4.99) are thus symmetric 
hyperbolic as they stand. If symmetries are present which relate the index 
6 to a group of unprimed spinor indices comprised by /3, the equations to 
be extracted are slightly different. For instance, we obtain from (4.98) a 
symmetric hyperbolic system (regarding all fields besides the Weyl spinor 
field as given) of form 

u 'f'ooo/ = • ■ ■ , 

V Fablf — Q/ FabOf = ■ ■ • , 

where the symmetry Fabcd = F(^^bcd) is assumed explicitly so that there are 
five complex unknown functions. Equations (4.105) and (4.106) can be dealt 
with similarly. 

To compare the characteristics of the system above with previous hyper- 
bolic systems extracted from the Bianchi identity, we set 
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62 — — 6io')j 63 — — eii'). 

Then the characteristic polynomial is given up to a positive constant factor 
by 

(4.107) 

which contains the degenerate quadratic form 



fc'^" = 2e'^oe%-e'^ie"i-e'^2e"2- 



The cone {k^'^ = 0} is the product of a 2-dimensional cone in the plane 

3 = 0} with the real line so that its set of generators is diffeomor- 
phic to X R. The associated characteristics are timelike. The special role 
played here by the vector field 63 allows us to adapt the system to situations 
containing a distinguished direction. 

Another method to extract symmetric hyperbolic equations from spinor 
equations has been discussed in [37]. It is based on the space-spinor formalism 
in which an arbitrary normalized timelike vector field is used to express all 
spinor fields and spinor equations in terms of fields and equations containing 
only unprimed indices. If the fields and equations are then decomposed into 
their irreducible parts (a direct, though somewhat lengthy algebraic proce- 
dure), the equations almost automatically decompose into symmetric hyper- 
bolic propagation equations and constraints. 

For simplicity we choose the timelike vector field to be 

'/2eo= Caa' with Taa' = Co “ eo' “ + ei “ ey “ . 

Since Taa' = Cq ^ etc., maps generalizing the map oJa' —>'Ta°' oJa' to 
spinors of arbitrary valence are bijective and allow us to obtain faithful rep- 
resentations of all spinor relations in terms of unprimed spinors. Writing 
Vob = Tfe “ Vaa' = + T^ab, we obtain a representation of the co- 

variant derivative operator in terms of the directional derivative operators 
P = Vaa', 'Dab = T^b‘^ a)a' acting iu the direction of cq and in di- 
rections orthogonal to eg respectively. In particular, (4.98) splits under the 
operations indicated above into ‘constraints’ 

VfS^abfg = ..., 



and ‘evolution equations’ 



P 'Pabcd — 2 I7(a ^ 'Pbcd)f = ■ ■ ■ 

If the latter are multiplied by the binomial coefficients (a^i,^c+d)’ Ibey are 
seen to be symmetric hyperbolic. The characteristic poynomial of this system 
is again of the form 
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however, since there is no privileged spacelike direction singled out here, we 
have a non-degenerate quadratic form 

= (1 + c) e% e% - 1 e" 1 - 2 e" 2 - 3 e" 3, (4. 108) 

with some constant c > 0, as in the case of the system used in the case of the 
Einstein-Euler equations. 

By the method outlined above symmetric hyperbolic systems are also 
obtained for equations (4.105) and (4.106^. For instance, from the first of 
equations (4.105) we obtain for (j)abc = Tc^ 4>abb' = 4>{ab)c an equation of the 
form V“ d 4>abc = ■ • ■, where we only indicate the principal part. Using the 
decomposition 

2 

4^abc 4^abc ^ ^c(a 4^b) with 4^abc 4^{abc)i 4^b ^ i 

and the decomposition of Va&, we get a system of the form 

P + ^abc = 2 ^bac = . . . , 

which is symmetric hyperbolic. 

It is well known that equations for spinor fields of spin y, m > 2 give rise 
to consistency conditions (cf. [72]). For instance, the equation 

^ a' 4^abcb'c' — Hbca'b'd i (4.109) 

where we consider the right hand side as given, implies the relation 

4>abcb'c' +4^abcd'(b' c') = Hada’b’d, 

which reduces e.g. in the case of vanishing right hand side to a particular 
relation between the background curvature and the unknown spinor field. 
Depending on the type of equation and the background space-time, such 
consistency conditions may forbid the existence of any solution at all. Nev- 
ertheless, equation (4.109) implies a symmetric hyperbolic system for which 
the existence of solutions is no problem. Difficulties will arise if one wants to 
show that the constraints implied by (4.109) are preserved. 

This example emphasizes the need to show the preservation of the con- 
straints. Because in our case the right hand sides of the equations are given by 
very specific functions of the unknowns themselves, we can expect to obtain 
useful subsidiary equations. 
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There are again various methods to obtain hyperbolic equations for the 
gauge-dependent frame and connection coefficients, which depend in partic- 
ular on the choice of gauge conditions. In [36], [37] the coordinates and the 
frame field have been subject to wave equations (nonlinear in the case of 
the frame field). Here we shall indicate a gauge considered in [38] (which 
can, of course, also be implemented in the frame formalism considered in the 
previous sections). 

We shall denote by T a ‘time flow vector field’ and by coordinates 
on some neighbourhood of an initial hypersurface S. We assume T to be 
transverse to S, the ‘time coordinate ’ t = to vanish on S, and the relation 
< dx^,T >= 0 to hold on the neighbourhood such that we can write 

T = df 

The frame Caa' is chosen such that the timelike vector field r““ Caa' is 
orthogonal to S. Using the expansion 

^aa' 2 ^ ^cc' ^ a' ^ab with G-ab '^(b ^a)aO 

we can write 



T — Ccc' + Occ' — Cab, 

with 

r“' = 0, /3ab = T(, (4.110) 

Thus the evolution of the coordinates off S is determined by the fields a yf 0 
and /3““ and we can write 

= -(5t-/3“^e,b). (4.111) 

a 

Since we have Vyic = c^b, the evolution of the frame is determined 
by the functions 

rpaa 

^ be — ^ aa'bc' 

and we can write 

t“' raa'bc = - {Fbc - /3“ r^ebc). (4.112) 

a 

with Faebc = T(e T'„)a'bc- 

We now consider the fields a = a{x^) > 0, = /3“^(a;^) (together four 

real functions) as ‘coordinate gauge source functions’ and the field Ibc = 
7bc(a;^) (six real functions) as ‘frame gauge source functions’. This is feasible, 
because given these functions, we can find smooth coordinates and a frame 
fields close to an initial hypersurface such that the given functions assume 
the meaning given to them above. 

The gauge conditions are then expressed by the requirement that the 
right hand sides of (4.111) and (4.112) are given in terms of the gauge source 
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functions and Cab and Faebc- Thus it remains to obtain evolution equations 
for ab and Fo^^bc- 

Reading the quantity aa' for fixed index ^ as the expression of the 
differential of in the frame CoaS we can write (4.95) in the form 

^ aa'^^ bb' ^ bb'^^ aa' — 0- 

Contracting this equation with and ^ and symmetrizing, we obtain 
the equation 

0 = Vt fc,(bVT r,) aa' 

which can be rewritten in the form 

dte^ ab = ■■■, 

where the right hand side can be expressed in terms of the gauge source 
functions and their derivatives and the unknowns. By using (4.97) with (4.96) 
on the left hand side, we can derive in a similar way an equation 



Faebc ■ ■ ■ : 



with the right hand side again being given in terms of the gauge source 
functions and their derivatives and the unknowns. 

Thus we obtain symmetric hyperbolic reduced equations for all unknowns 
except those given by the left hand sides of the gauge conditions (4.111) and 
(4.112). Our procedure applies of course to various other sytems. Our choice 
of gauge is of interest because of the direct relation between the gauge source 
functions and the evolution of the gauge. The causal nature of the evolution 
can be controlled explicitly because the formalism allows us to calculate the 
value of the norm g{T,T) = 2 + Pabl3°'^- This may prove useful if it is 

desired to control the effect of the choice of gauge source functions on the 
long time evolution of the gauge in numerical calculations of space-times. 

4.5 Remarks on the Structure of the Characteristic Set 

We have seen that for certain reduced systems there occur besides the ‘physi- 
cal’ characteristics, given by null hypersurfaces, also characteristics which are 
timelike or spacelike with respect to the metric Timelike characteristics, 
which usually occur if a system of first order is deduced from a system of sec- 
ond order, are usually harmless and of no physical significance. The spacelike 
characteristics, which are partly due to the choice of gauge condition and 
partly due to the use made of the constraints, have no physical significance 
either. Though they are associated with non-causal propagation, there is a 
priori nothing bad about them and it rather depends on the applications one 
wants to make whether they are harmful or not. 
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In the characteristic polynomial (4.71) of the reduced equations for the 
Einstein-Euler system there appears a factor Co + i Cfc which corre- 

sponds to timelike characteristics. In vacuum the corresponding cone has of 
course no physical meaning, since in general there is no preferred timelike 
vector field available. In the perfect fluid case there is a distinguished time- 
like vector held present. This, and perhaps the symmetry of the inner cone 
with respect to the fluid vector, has led some people to speculate on the phys- 
ical significance of that cone [31]. However, in some of the later examples of 
hyperbolic equations deduced from the Bianchi identity, which could also be 
used in the fluid case, the structure of the characteristics is drastically differ- 
ent from the one observed in the fluid case (cf. (4.94) and also the degenerate 
cone arising in (4.107)). In particular, the factor above does not occur in their 
characteristic polynomials. The large arbitrariness in extracting hyperbolic 
equations, which arises from different use made of the constraints implied by 
the Bianchi identity, suggests that in the case of the Einstein-Euler system 
the only ‘physical characteristics’ are those associated with the fluid vector, 
the null cone of and the sound cone. 

The null cone of the metric (4.94) touches the null cone of the metric g 
in the directions of ±63 but it is spacelike in all other directions. Thus all 
null hypersurfaces of it are spacelike or null for Such a cone has the 
effect that the ‘domain of uniqueness’ defined by the techniques discussed in 
Sect. 3.1 may decrease. However, as we have seen, is does not prevent us from 
proving useful results. 

There is also no reason to assume that the additional characteristics nec- 
essarily create problems in numerical calculations. In situations where the 
maximal slicing condition can be used, the occurrence of spacelike character- 
istics which are related, as in our examples, in a rigid way with the metric 
should be innocuous. Also, numerical calculations based on equations with 
inner characteristic cones as observed above have been performed without 
difficulties ([35], [55]). 

5 Local Evolution 

5.1 Local Existence Theorems for the Einstein Equations 

The purpose of this section is to present a local existence theorem for the 
Einstein vacuum equations. By (abstract) vacuum initial data we mean a 
three-dimensional manifold S together with a Riemannian metric h^b and a 
symmetric tensor Xab on S which satisfy the vacuum constraints (see Sect. 2). 
A corresponding solution of the vacuum Einstein equations is a Lorentzian 
metric gajS on a four-dimensional manifold M and an embedding (f> of S into 
M such that hab and Xab coincide with the pull-backs via 4> of the induced 
metric on (j){S) and the second fundamental form of that manifold respectively 
and the Einstein tensor of ga 0 vanishes. If 4>{S) is a Cauchy surface for the 
space-time (M, gaff) then this space-time is said to be a Cauchy development 
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of the data {S,hab,Xab)- The basic local existence theorem for the vacuum 
Einstein equations says that every vacuum initial data set has at least one 
Cauchy development. In fact to make this precise it is necessary to fix the 
differentiability properties which are assumed for the data and demanded of 
the solution. For instance, the result holds if the differentiability class for 
both data and solutions is taken to be C°°. Note that there is no need to 
require any further conditions on the spatial dependence of the data. 

The proof of local existence will now be outlined. We follow essentially the 
original method of [16] except for the fact that we reduce second order equa- 
tions to first order symmetric hyperbolic systems and that we use harmonic 
mappings rather than harmonic coordinates. The use of harmonic mappings, 
as discussed in Sect. 4, allows us to work globally in space even if the man- 
ifold S cannot be covered by a single chart. Using harmonic coordinates it 
would be necessary to construct solutions local in space and time and then 
piece them together. Choose a fixed Lorentz metric on S' x R, for instance 
the metric product of the metric h^b with —dt^. This comparison metric will 
be denoted by Qap. The idea is to look for a solution gai 3 on an open sub- 
set [/ of R. X S such that the identity is a harmonic map from ([/, gais) to 
{U,gaf})- This is a condition which is defined in a global invariant way. Its 
expression in local coordinates is = 0 where F^^ and F^^ 

are the Christoffel symbols of gag and gag respectively. In the terminology 
of Sect. 2.4 this means that we choose g^'^F^^ as a gauge source function. 

Next consider the question of reduction of nonlinear wave equations to 
symmetric hyperbolic form. This is done as follows. Let g°‘^{t,x,u) be func- 
tions of (t,x,u) which for each fixed value of (t,x,u) make up a symmetric 
matrix of Lorentz signature and consider an equation of the form: 



This has been formulated in a local way but a corresponding class of equations 
can be defined in the case that the unknown it is a section of a fibre bundle. 
As in the above treatment of symmetric hyperbolic equations on a manifold, 
consideration will be restricted to the case of sections of a vector bundle V. 
Choose a fixed connection on V. Then the class of equations to be considered 
is obtained by replacing the partial derivatives in the above equation by 
covariant derivatives defined by the given connection. Let Ua = VqU. Then 
the equation (5.1) can be written as: 



g’^^dadgu + F{t, X, it, Du) = 0 



(5.1) 



-5°°Voiio - 2g°^VaUo 



g°-^^aUb + F{t,X,Uo,Ua) 



Volt 



= Uo 



Here is a term involving the curvature of the connection which is of order 
zero in the unknowns of the system. This is a symmetric hyperbolic system 
for the unknowns u and Ua- Since u is allowed to be a section of a vector 
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bundle we are dealing with a system of equations. However the functions 
must be scalars. The appropriate initial data for the second order equation 
consists of the values of u and dtu on the initial hypersurface. From these 
the values of the functions Ua on the initial hypersurface may be determined. 
Thus an initial data set for the symmetric hyperbolic system is obtained. It 
satisfies the additional constraint equation = Ua- Applying the existence 
theory for symmetric hyperbolic systems gives a solution (u,Ua). To show 
that the function u obtained in this way is a solution of the original second 
order equation it is necessary to show that the constraint equation is satisfied 
everwhere. That Vqu = uq follows directly from the first order system. It also 
follows from the first order system that Vo(wa ~ = 0. Since this is a first 

order homogeneous ODE for Ua — the vanishing of the latter quantity 
for t = 0 implies its vanishing everywhere. 

In the case of the vacuum Einstein equations the bundle V can be taken 
to be the bundle of symmetric covariant second rank tensors. The connection 
can be chosen to be the Levi-Civita connection defined by gajs- This is only 
one possible choice but note that it is important that this connection does 
not depend on the unknown in the equations, in this case the metric ga/3- 

Now a proof of local in time existence for the vacuum Einstein equations 
will be presented. Let hat and Xab be the initial data. In Sect. 2 it was shown 
that the vacuum Einstein equations reduce to a system of nonlinear wave 
equations when harmonic coordinates, or the generalization involving gauge 
source functions, are used. As was already indicated in that section, there is 
no loss of generality in imposing this condition locally in time. If there exists 
a development of particular initial data then there exists a diffeomorphism (j) 
of a neighbourhood of the initial hypersurface such that the pull-back of the 
metric with the given diffeomorphism satisfies the harmonic condition with 
respect to ga0- In fact </> can be chosen to satisfy some additional conditions. 
The harmonic condition is equivalent to a nonlinear wave equation for (j). 
Solving the local in time Cauchy problem for this wave equation provides 
the desired diffeomorphism. The existence theory for this Cauchy problem 
follows from that for symmetric hyperbolic systems by the reduction to first 
order already presented. The initial data for (p will be specified as follows. It 
is the identity on the initial hypersurface as is the contraction of its derivative 
with the normal vector with respect to ga/s- Since the vector d/dt is the unit 
normal vector to S with respect to ga/3 it will also have this property with 
respect to gajs- 

A local solution of the Einstein equations corresponding to prescribed ini- 
tial data can be obtained as follows. Let hab and Xab denote the components 
of the tensors making up the initial data in a local chart as above. A set of 
initial data for the harmonically reduced vacuum Einstein equations consists 
of values for the whole metric gafs and its time derivative on the initial hy- 
persurface. A data set of this kind can be constructed from hab and Xab as 
follows. (The following equations are expressed in local coordinates, but their 
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invariant meaning should be clear.) 

9ab — ^ab: 90a — 0; 900 — 1 

dt9ab = 2Xab, dtgoa = h^‘"{hab,c - {l/2)hbc,a) - habr^d^'^'^: 

dtgoo = - 2 h°-^Xab 

These data are chosen in such a way that the harmonic condition is satisfied 
on the initial hypersurface. 

Corresponding to the initial data for the reduced equations there is a 
unique local solution of these equations. It remains to show that it is actually a 
solution of the Einstein equations provided the initial data hab and Xab satisfy 
the Einstein constraint equations. In order to do this it suffices to show that 
the harmonic conditions are satisfied everywhere since under those conditions 
the reduced equations are equivalent to the Einstein equations. Let Z\“ = 
— r^^). That the harmonic conditions are satisfied can be verified 
using the fact that the quantities Z\“ satisfy a linear homogeneous system of 
wave equations. By uniqueness for this system the vanish provided the 
initial data Z\“ and vanish on the initial hypersurface. The first of these 
was built into the construction of the data for the reduced equations. The 
second is a consequence of the combination of the reduced equations with the 
Einstein constraints. 

5.2 Uniqueness 

The argument of the last section gives an existence proof for solutions of the 
vacuum Einstein equations, local in time. It does not immediately say any- 
thing about uniqueness of the space-time constructed. The solution of the 
reduced equations is unique, as a consequence of the uniqueness theorem for 
solutions of symmetric hyperbolic systems. However the freedom to do diffeo- 
morphisms has not yet been explored. In fact it is straightforward to obtain a 
statement of uniqueness of the solution corresponding to given abstract initial 
data, up to diffeomorphism. Suppose two solutions g\ and 52 with the same 
initial data are given. Choose a reference metric g as before and determine 
diffeomorphisms (pi and p2 such that the identity is a harmonic map with 
respect to the pairs {g,gi) and (5,52) respectively. The transformed metrics 
satisfy the same system of reduced equations with the same initial data and 
thus must coincide on a neighbourhood of the initial hypersurface. Thus gi 
and {p2 ° 4'i^)*g2 coincide on a neighbourhood of the initial hypersurface. 
This statement is often referred to as ‘geometric uniqueness’. 

For hyperbolic equations it is in general hard to prove theorems about 
global existence of solutions due to the possibility of the formation of singu- 
larities. On the other hand, it is possible to prove global uniqueness theorems. 
Proving global uniqueness for the Einstein equations is more difficult due to 
difficulties with controlling the freedom to do diffeomorphisms. In this con- 
text we use the word ’global’ to mean not just applying to a subset of a 
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given solution, but to whole solutions having suitable intrinsic properties. 
The strategy which has just been used to prove local existence using har- 
monic maps cannot be applied directly. For doing so would require a global 
existence theorem for harmonic maps and that we do not have in general. 

A global uniqeness theorem was proved by abstract means by Choquet- 
Bruhat and Geroch [19], who introduced the notion of the maximal Cauchy 
development. If initial data for the Einstein equations coupled to some matter 
fields on a manifold S are prescribed, a development of the data is a solution 
of the Einstein-matter system on a manifold M together with an embedding 
4> oi S into a M which induces the correct initial data and for which the 
image of S' is a Cauchy surface. Another development with a solution of the 
Einstein-matter equations on a manifold M' and an embedding (jj' is called 
an extension of the first if there is a diffeomorphism ijj from M to an open 
subset U of M' which maps the given metric on M onto the restriction of 
the metric on M' to U and also maps the matter fields on M to those on U 
obtained by restriction from M' . If we are dealing with the vacuum Einstein 
equations then the requirement on the matter fields is absent. In [19] the 
following theorem was proved for the vacuum case: 

Theorem Let S be an initial data set. Then there exists a development M of 
S which is an extension of every other development of S. This development 
is unique up to isometry. 

The development whose existence and uniqueness is asserted by this theorem 
is called the maximal Cauchy development of the initial data set. Uniqueness 
up to isometry means the following. If we have two developments of the same 
data given by embeddings </> and 4>' of S into manifolds M and M' respectively 
then there exists a diffeomorphism tjj •. M ^ M' which is an isometry and 
satisfies 4>' = (p o tj;. The proof of this theorem does not depend strongly on 
the vacuum assumption. One potential problem in extending it to certain 
matter fields is gauge freedom in those fields. This requires the concept of 
extension to be defined in a slightly different way. For instance in the case 
of gauge fields it is necessary to consider not only diffeomorphisms of the 
base manifold, but also automorphisms of the principal bundle entering into 
the definition of the theory. We do not expect that this leads to any essential 
difficulty, but in any concrete example one should pay attention to this point. 

The proof of this theorem applies directly to Zorn’s lemma and it is an 
open question whether it is possible to remove the use of the axiom of choice 
from the argument. The maximal Cauchy development is often very useful in 
formulating certain arguments. However it remains very abstract and gives 
the subjective impression of being difficult to pin down. 

It should be emphasized that, despite its global aspects, it would be mis- 
leading to consider the above theorem as a global existence theorem for the 
Einstein equations in any sense. A comparison with ordinary differential equa- 
tions may help to make this clear. If an ordinary differential equation for 
a function u{t) is given (with smooth coefficients) then the standard local 
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existence theorem for ordinary differential equations says that given an ini- 
tial value uq there exists a T > 0 and a unique solution u{t) on the inter- 
val {—T,T) with u(0) = uq. Now one can ask for the longest time interval 
(— Ti,T 2 ) on which a solution of this kind exists for a fixed initial value uq. 
This is called the maximal interval of existence and has a similar status to 
that of the maximal Cauchy development. The fact that the maximal inter- 
val of existence is well-defined says nothing about the question whether the 
solution exists globally or not, which is the question whether Ti and T 2 are 
infinite or not. The existence of the maximal Cauchy development says noth- 
ing about the global properties of the solution obtained in this way, whereas 
global existence of the solution of an ordinary differential equation does mean 
that the solution has a certain global property, namely that it exists for an 
infinitely long time. 

5.3 Cauchy Stability 

Cauchy stability of the initial value problem for the Einstein equations is the 
statement that, in an appropriate sense, the solution of the Einstein equations 
depends continuously on the initial data. This continuity statement has two 
parts, which can be stated intuitively in the following way. Firstly, if a solution 
corresponding to one initial data set is defined on a suitable closed region, 
then the solution corresponding to any initial data set close enough to the 
original one will be defined on the same region. Closeness is defined in terms of 
Sobolev norms. Care is needed with the interpretation of the phrase ‘the same 
region’ due to the diffeomorphism invariance. To make it precise, something 
has to be said about how regions of the different spacetimes involved are to 
be compared with each other. Secondly, the solution defined on this common 
region depends continuously on the initial data, where continuity is again 
defined in terms of Sobolev norms. In non-compact situations it is appropriate 
to use local Sobolev norms for this, i.e. the Sobolev norms of restrictions of 
a function to compact sets. 

Rather than make this precise in general we will restrict to one case where 
the formulation of the statement is relatively simple, but which is still general 
enough to give a good idea of the basic concepts. Consider initial data sets 
for the Einstein equations on a compact manifold S. For definiteness let us 
restrict to the vacuum case. As has been discussed above, solutions can be 
constructed by using the harmonically reduced equations. One step in this 
process is to associate to geometric data (hab,Xab) full data (ffo,/?, For 

each of these pairs let us choose the topology of the Sobolev space H^{S) 
for the first member and that of for the second. Then standard 

properties of Sobolev spaces show that if s is sufficiently large the mapping 
from geometric data to full data is continuous. Suppose now that we have 
one particular solution of the Einstein vacuum equations with data on S. 
The corresponding solution of the harmonically reduced equations exists on 
some region of the form S x [-T, T], If s is sufficiently large then there exists 
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an open neighbourhood of the given data in H^{S) x such that 

for any data in this neighbourhood there exists a corresponding solution in 
H^{S X [-T, T]). Moreover the mapping from data to solutions defined on this 
neighbourhood is continuous (in fact differentiable) . This has been proved by 
Choquet-Bruhat [18]. 

The theorem concerning a compact initial hypersurface can also be mod- 
ified to give a local statement of the following type. Let initial data for the 
Einstein equations be given on some manifold S and suppose that a corre- 
sponding solution is given on a manifold M . There is a neighbourhood U of 
S where harmonic reduction is possible globally. This identifies U with an 
open subset of S' x R. This contains a set of the form V x [-T, T] (for some 
open subset E of S and some T > 0) which contains any given point of the 
initial hypersurface. If we cut off the the initial data for the harmonically 
reduced equations and use the domain of dependence, we can use the above 
statement for a compact initial hypersurface to get continuous dependence on 
initial data for a possibly smaller set V x [—T\ T'j. Summing up, each point 
sufficiently close to the initial hypersurface has a neighbourhood Wi with 
compact closure such that there is an open subset W 2 of the initial hyper- 
surface with compact closure such that the following properties hold. If the 
restriction of an initial data set for the Einstein equations is sufficiently close 
to that of the original data set in H‘‘{W 2 ) then there exists a correspond- 
ing solution of class on a neighbourhood of Wi. Moreover, the resulting 
mapping from to is continuous. 

5.4 Matter Models 

To specify a matter model in general relativity three elements are required. 
The first is a set of tensors (or perhaps other geometrical objects) on space- 
time which describe the matter fields. The second is the equations of motion 
which are to be satisifed by these fields. The third is the expression for the 
energy-momentum tensor in terms of the matter fields which is to be used to 
couple the matter to the Einstein equations. Note that in general both the 
matter field equations and the expression for the energy-momentum tensor 
involve the space-time metric. Thus it is impossible to consider matter in 
isolation from the space-time metric. In solving the Cauchy problem it is 
necessary to deal with the coupled system consisting of the Einstein equations 
and the equations of motion for the matter fields. 

There are two broad classes of matter models which are considered in gen- 
eral relativity, the field theoretical and phenomenological matter models. The 
distinction between these is not sharply defined but is useful in order to struc- 
ture the different models. The intuitive idea is that the field theoretic matter 
models correspond to a fundamental description while the phenomenological 
models represent an effective description of matter which may be useful in 
certain situations. Within the context of classical general relativity, which is 
the context of this article, the pretension of the field theoretic matter models 
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to be more fundamental is not well founded since on a fundamental level the 
quantum mechanical nature of matter should be takeir into account. 

Before goiirg further, a geireral remark on the Einsteiir-matter equations 
is in order. Suppose that iir any given coordinate system the matter equa- 
tions can be written in symmetric hyperbolic form in terms of a variable u. 
Consider the system obtained by coupling the harmonically reduced Einstein 
equations, written in first order symmetric hyperbolic form, to the sytem 
for u. If the coupling is only by terms of order zero then the combined sys- 
tem is symmetric hyperbolic and a local existence theorem for the reduced 
Einstein-matter system is obtained. The condition for this to happen is that 
the equations for the matter fields coirtaiir at most first derivatives of the 
metric (iir practice the Christoffel symbols) and that the eirergy-momentum 
tensor contains no derivatives of u. Wheir these conditions are satisfied, local 
existence for the Einstein-matter equations (irot just the reduced equations) 
cair be proved usiirg the same strategy as we preseirted iir the vacuum case. 
The fact, which should hold for any physically reasonable matter model, that 
the energy-momentum tensor is divergence free as a consequence of the mat- 
ter field equations, can be is used derive the equation which allows it to be 
proved that the harmonic condition propagates. 

It would be unreasonable to try and describe here all the matter models 
which have ever been used in general relativity. We will, however, attempt to 
give a sufficiently wide variety of examples to illustrate most of the important 
features to be expected in general. We start with the field theoretic models. 

The simplest case is where the matter field is a single real- valued function 
(j). The equations of motion are: 

Here m is a constant and E is a smooth function which is 0{4>^) for (j) close to 
zero. A typical example would be V{(p) = The energy-momentum tensor 
is: 

- [(1/2)(V^0V^</)) + + 2V{4>)]g^p 

The equation for is a nonlinear wave equation and so may be reduced 
to a symmetric hyperbolic system. When it is coupled to the harmonically 
reduced Einstein equations via the energy-momentum tensor above and the 
whole system reduced to first order there is no coupling in the principal part. 
As mentioned above this is enough to allow a local existence theorem to 
be proved. Note that the splittiirg off of the nonlinear term V aird the sign 
coirditioir following from the form of the coefficient in the liirear term 
are irrelevant for the local well-posedness of the equations. On the one haird 
they are motivated by considerations of the physical interpretation of the 
equations. Oir the other haird they have an important influence on the global 
behaviour of solutions. This matter model is often referred to as ‘the scalar 
field’ , although when used without qualification this often means the special 
case m = 0, V = 0. 
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The scalar field can be thought of as a mapping into the real line. It 
can be generalized by considering mappings into a Riemannian manifold N. 
An equivalent of the massless scalar field with vanishing potential is the 
nonlinear cr-model or wave map as it is often known to physicists and math- 
ematicians respectively. It is a mapping from space-time into a manifold N 
with Riemannian metric h called the target manifold. The field equations 
and energy-momentum tensor have a coordinate-invariant meaning but we 
will content ourselves with giving the expressions in coodinate systems on M 
and N. The field equations are 

where are the Christoffel symbols of h in some coordinate system. The 
energy-momentum tensor is: 

The special case where N is the complex plane with the flat Euclidean metric 
corresponds to the complex scalar field. In contrast to the case of the scalar 
field, the wave map does not allow the addition of a mass term or a potential 
term in any obvious way. If N has the structure of a vector space there is an 
obvious way of defining a mass term and further structure on N may lead 
to natural ways of defining a potential. These features occur in the case of 
Higgs fields. It may be noted that the wave maps considered here, which are 
sometimes also called hyperbolic harmonic maps, are related mathematically 
to the harmonic gauge discussed in Sect. 4. The role of the connection F in 
Sect. 4 is played here by the Levi-Civita connection of the target manifold. 

One of the most familiar matter models in general relativity is the Maxwell 
field. This is described by an antisymmetric tensor Fap. The equations of mo- 
tion for the source-free Maxwell field are \7aF°‘^ = 0 and S/aFf}^ + V/jTlyo, -I- 
V.yT'a /3 = 0 and the energy-momentum tensor is 

- {\/A)F^^F^sg^p 

The second set of Maxwell equations can be solved locally by writing F^p = 
V ccAp — V pAa for a potential Aa- Then the other equations can be regarded 
as second order equations for Aa- Note, however, that if space-time has a 
non-trivial topology then it may be impossible to find a global potential 
which reproduces a given field F^p. In the same way that the scalar field 
can be generalized to get wave maps, the Maxwell field can be generalized to 
get Yang-Mills fields. We will not give the global description of these fields 
involving principal fibre bundles but only give expressions in local coordinates 
and a local gauge. The model is defined by the choice of a Lie algebra (which 
we describe via a basis) and a positive definite quadratic form on the Lie 
algebra with components in this basis. Let be the structure constants 
in this basis. The basic matter field is a one-form with values in the Lie 
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algebra and the field strength is defined by 

^i/3 = ^ ^ pAa + Cjj^A'IA^ 

The field equations are 






In the special case where the Lie algebra is one-dimensional (and hence 
Abelian) the Yang-Mills equations reduce to the Maxwell equations. Note 
however that the Yang-Mills field cannot be described by the field strength 
alone. The description in terms of a potential is indispensible. 

A complication which arises when studying the initial value problem for 
the Yang-Mills or Einstein-Yang-Mills systems is that of gauge invariance. 
Although the potential is required it is not uniquely determined. Gauge trans- 
formations of the form: 



leave invariant. Here 5 is a function taking values in a Lie group with 
the given Lie algebra and the expression g~^'S/ g can naturally be identified 
with a Lie-algebra- valued one-form. Fields related by a gauge transformation 
describe the same physical system. The ambiguity here is similar to that of 
the ambiguity of different coordinate systems in the case of the Einstein equa- 
tions. It can be solved in an analogous way by the use of the Lorentz gauge. 
This is similar to the harmonic coordinate condition and reduces the Yang- 
Mills equations on any background to a system of nonlinear wave equations 
which can, if desired, be reduced to a symmetric hyperbolic system. Combin- 
ing harmonic coordinates and Lorentz gauge produces a reduced Einstein- 
Yang-Mills system which can be handled by the same sort of techniques as 
the reduced vacuum Einstein equations. Of course there are a number of 
steps which have to be checked, such as the propagation of Lorentz gauge. 
All these comments apply equally well to the Einstein-Yang-Mills-Higgs sys- 
tem obtained by coupling the Yang-Mills field to a Higgs field. (Now gauge 
transformations for the Higgs field must also be specified.) 

A field theoretic matter model whose Cauchy problem does not fit easily 
into the above framework is that given by the Dirac equation. It has been 
discussed in Sect. 4.4. 

Probably the best known phenomenological matter model in general rel- 
ativity is the perfect fluid. This has already been discussed at some length in 
Sect. 4.2. Here we mention some complementary aspects. Recall that the ba- 
sic matter fields are the energy density p, a non-negative real-valued function, 
and the four- velocity C/“, a unit timelike vector field. The energy-momentum 
tensor is given by 



= {p + p)U°‘U^ + pg°‘^ 
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in the signature used here, namely (—,+,+,+). The pressure p is given in the 
isentropic case in terms of an equation of state p = h{p). The usual assump- 
tions on this equation of state is that it is a non-negative continuous function 
defined on an interval [pojOo) which is positive for p > 0. Also it should be 
for p > 0 with positive derivative h'(p) = dp/dp there. The equations of 
motion of the fluid, the Euler equations, are given by the condition that the 
energy-momentum tensor should be divergence-free. The Euler equations can 
be written as a well-posed symmetric hyperbolic system in terms of the basic 
variables provided we restrict to cases where p > C > 0 for some C > 0. 
Here the condition h' > 0 is crucial. In the study of spatially homogeneous 
cosmological models the equation of state p = kp with fc < 0 is sometimes 
considered. We emphasize that a fluid with an equation of state of this kind 
cannot be expected to have a well-posed initial value problem. This has to do 
with the fact that the speed of sound, which is the square root of h! , is imag- 
inary in that case. In Sect. 5.5 we prove a related but simpler result, namely 
that in special relativity the Euler equations with this equation of state, lin- 
earized about a constant state, have an ill-posed initial value problem. The 
system obtained by coupling the Euler equations to the harmonically reduced 
Einstein equations can be written as a symmetric hyperbolic system in the 
case that the Euler equations can be written symmetric hyperbolic in terms 
of the basic variables, as stated above. 

In the case where the density is everywhere positive, writing the Euler 
equations in symmetric hyperbolic form is not trivial. One approach is to 
take p and the spatial components C/® of the velocity as variables and to 
express in terms of the C/® and the metric via the normalization condition 
U°‘Ua = —1 (see [87]). Another possibility (see [15]) is to consider the Euler 
equations as evolution equations for p and [/“ and treat the normalization 
condition as a constraint, whose propagation must be demonstrated. The 
treatments above are limited to the isentropic Euler equations. For a fluid 
which is not isentropic the conservation equation for the energy-momentum 
tensor must be supplemented by the equation of conservation of entropy 
VqS = 0. The equation of state can then be written in the form p = h(p, s) 
or, equivalently, in the form p = f{n,s), where n is the number density of 
particles. We are not aware that a local existence theorem non-isentropic 
Euler equations has been proved by a generalization of the method for the 
isentropic case just outlined, although there is no reason to suppose that 
it cannot be done, provided the condition dh/dp > 0 is satisfied. As we 
saw in Sect. 4.2, the Einstein equations coupled to the non-isentropic Euler 
equations can be brought into symmetric hyperbolic form by introducing 
additional variables in a suitable way, following [39]. The Cauchy problem 
for the general (i.e. not necessarily isentropic) Euler equations coupled to the 
Einstein equations had much earlier been solved by other means by Choquet- 
Bruhat using the theory of Leray hyperbolic systems [17]. 
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If it is desired to show the existence of solutions of the Einstein-Euler 
system representing dynamical fluid bodies (such as oscillating stars) then 
problems arise. Either the density must become zero somewhere, in which 
case the Euler equations as written in the usual variables fail to be symmetric 
hyperbolic there, or at least the density must come arbitrarily close to zero 
at infinity, which is almost as bad. Treating this situation as a pure initial 
value problem is only possible under restrictive circumstances [76] and the 
general problem is still open. It would seem more promising to try to use the 
theory of initial boundary value problems, explicitly taking account of the 
boundary of the fluid. So far this has only been achieved in the spherically 
symmetric case with po > 0 [61]. 

One case of a fluid which is frequently considered in general relativity is 
dust. This is defined by the condition that the presure should be identically 
zero. Since in that case h' = 0 the straightforward method of writing the fluid 
equations as a symmetric hyperbolic system does not work. The symmetric 
hyperbolic system of [39] discussed in Sect. 4.2 also covers the dust case as 
does the existence theorem of Choquet-Bruhat [17] using a Leray hyperbolic 
system. 

The next phenomenological matter model we will consider comes from 
kinetic theory. It does not quite fit into the framework we have used so far to 
describe matter models since the fundamental matter field is a non-negative 
function / on the cotangent bundle of space-time. (Often the case of particles 
with a fixed mass is considered in which case it is defined on the subset of 
the cotangent bundle defined by the condition g°^^papp = —1, known as 
the mass shell.) The idea is that the matter consists of particles which are 
described statistically with respect to their position and momentum. The 
function / represents the density of particles. The geodesic flow of the space- 
time metric defines a vector field (Liouville vector field) on the cotangent 
bundle. Call it L. This flow describes the evolution of individual test particles. 
The equation of motion for the particles is Lf = Q{f) where Q{f) is an 
integral expression which is quadratic in its argument. This is the Boltzmann 
equation. It describes collisions between the particles in a statistical way. The 
case Q = 0 is the collisionless case, where the equation Lf = 0 obtained is 
often called the Vlasov equation. The energy-momentum tensor is defined by: 



where dw{p) represents a natural measure on the cotangent space or mass 
shell, depending on the case being considered. 

The coupled Einstein-Boltzmann system in harmonic coodinates cannot 
be a hyperbolic system is any usual sense for the simple reason that it is not 
even a system of differential equations, due to the integrals occurring. Never- 
theless, the techniques used to prove existence and uniqueness for hyperbolic 
equations can be adapted to prove local existence for the Einstein-Boltzmann 
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system [7] . In contrast to the case of the perfect fluid nothing particular hap- 
pens when the energy density vanishes so that there is no problem in de- 
scribing isolated concentrations of matter. The Einstein- Vlasov system can 
be used to describe globular clusters and galaxies. For the analogue of this 
in Newtonian theory see [131. Literature on the relativistic case can be found 
in [85]. 

Another kind of phenomenological matter model is elasticity theory. Apart 
from its abstract interest, self-gravitating relativistic elasticity is of interest 
for describing the solid crust of neutron stars. As in the case of a perfect fluid 
the held equations are equivalent to the equation that the divergence of the 
energy-momentum tensor is zero. What is different is the nature of the matter 
variables and the way they enter into the definition of the energy-momentum 
tensor. This is complicated and will not be treated here. The Cauchy problem 
for the Einstein equations coupled to elasticity theory has been discussed by 
Choquet-Bruhat and Lamoureux-Brousse [21]. A local existence theorem for 
the equations of relativistic elasticity has been proved by Pichon [73] for data 
belonging to Gevrey classes. These are classes of functions which are more 
special that C°° functions in that the growth of their Taylor coefficients is 
limited. However they do not have the property of analytic functions, that 
fixing the function on a small open set determines it everywhere. 

In non-relativistic physics, the Euler equations are an approximation to 
the Navier-Stokes equations where viscosity and heat conduction are ne- 
glected. The Navier-Stokes equations are dissipative with no limit to the 
speed at which effects can propagate. Mathematically this has the effect that 
the equations are parabolic with no finite domain of dependence. It is prob- 
lematic to find an analogue of the Navier-Stokes equations in general rel- 
ativity which takes account of the effects of diffusion and heat conduction. 
One possibility is to start from the Boltzmann equation, which does have a 
finite domain of dependence and try to do an expansion in the limit where 
the collision term is large. This is analogous to the Hilbert and Chapman- 
Enskog expansions in non-relativistic physics. The first attempts to do this led 
to equations which probably have no well-posed Cauchy problem (Landau- 
Lifschitz and Eckart models). Hiscock and Lindblom [53] have shown that 
the linearization of these equations about an equilibrium state have solutions 
which grow at arbitrarily large exponential rates. In response to this other 
classes of models were developed where the fluid equations are symmetric 
hyperbolic. (For information on this see [46]). These models do have a well- 
posed Cauchy problem and the main difficulty seems to be to decide between 
the many possible models. Since the variables used to formulate the sym- 
metric hyperbolic system for the fluid are not differentiated in forming the 
energy-momentum tensor, the system obtained by coupling these fluids to 
the Einstein equations can be written in symmetric hyperbolic form. 

More general matter models can be obtained from those already men- 
tioned by combining different types of matter field. For instance there is the 
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charged scalar field which combines a scalar field and a Maxwell field. This 
often produces no extra difficulties at all for the local in time Cauchy problem 
due to the fact, mentioned above, that the system obtained by taking two 
symmetric hyperbolic systems together is also symmetric hyperbolic, pro- 
vided the coupling between the two systems is via terms of order zero. It 
is also routine to allow charged particles in a kinetic model, obtaining the 
Einstein-Maxwell-Boltzmann system [7]. Charged fluids are more compli- 
cated. The local in time Cauchy problem has been treated in two cases, those 
of zero conductivity and infinite conductivity. The latter model is also known 
as magnetohydrodynamics. It has been shown to be have a well-posed initial 
value problem only for data in Gevrey classes [67] , [68] . 

5.5 An Example of an Ill-Posed Initial Value Problem 

It may be hard to appreciate the significance of a system of equations having 
a well-posed initial value problem since most examples which come up in 
practice do have this property. In this section we present an ill-posed example 
which is close to examples which relativists are familiar with. Consider the 
special relativistic Euler equations with equation of state p = kp where k < 0. 
A special solution is given by constant density and zero spatial velocity. Now 
consider the equations obtained by linearizing the Euler equations about this 
background solution. The unknowns in the linearized system will be denoted 
by adding a tilde to the corresponding unknowns in the nonlinear system. 
For simplicity we take the background density to be unity. The linearized 
equations are: 



dtP 

dtu^ 



-2(1 -f k)dau‘^ 



It will be shown that given any T > 0 there exist periodic initial data (po, ug) 
of class C°° such that it is not true that there is a unique corresponding 
solution, periodic in the space coordinates, on the time interval [0,T]. The 
condition of periodicity here does not play an essential role. It is adopted for 
convenience. Instead of thinking of smooth periodic functions on with can 
equally well think in terms of smooth functions on a torus T^. The space of 
smooth functions on can be made into a topological vector space X in a 
standard way. Convergence of functions in the sense of this topology means 
uniform convergence of the functions and their derivatives of all orders. In a 
similar way the space of smooth functions on [0, T] x can be made into 
a topological vector space Y . These are Frechet spaces [81]. The unknown in 
the linearized Euler equations can be thought of as an element of X'^ and 
the data as an element of Y^. Let Z be the closed linear subspace of X^ 
consisting of solutions of the linearized Euler equations. Consider the linear 
mapping L : Z ^ Y^ defined by restricting solutions to t = 0. It is continuous 
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with respect to the relevant topologies. If existence and uniqueness held for 
all initial data then this linear map would be invertible. Hence, by the open 
mapping theorem [81], it would have continuous inverse. What this means 
concretely is that given any sequence {po,n, Wq n) which is uniformly bounded 
together with each of its derivatives, the corresponding sequence of solutions 
(which exists by assumption) must also be bounded together with each of each 
of its derivatives. Thus to prove the desired theorem it is enough to exhibit 
a uniformly bounded sequence of initial data and a corresponding sequence 
of solutions which is not uniformly bounded. This can be done explicitly as 
follows: 



P0,n 

Pn 



From these explicit formulae we get an idea what is going wrong. Fourier 
modes of increasing frequencies of the initial data grow at increasing expo- 
nential rates. In the case of a fluid where the equation of state has a positive 
value of k the hyperbolic functions in the above formulae are replaced by 
trigonometric ones and the problem does not arise. 

In fact in the above example the density perturbation satisfies a second 
order equation which is elliptic. After a rescaling of the time coordinate it 
reduces to the Laplace equation. The computation which has just been done 
should be compared with the remarks on the Cauchy problem for the Laplace 
equation on p. 229 of [27], Vol. 2. The corresponding example for the Laplace 
equation goes back to Hadamard [49]. 

Solutions of the Einstein equations coupled to a fluid with an equation of 
state of the type considered in this section have been considered in the context 
of inflationary models [10], [93]. While this is unproblematic for spatially 
homogeneous models, the above ill-posedness result suggests strongly that 
this kind of model cannot give reasonable results in the inhomogeneous case. 

Some general comments on well-posedness and stability will now be made. 
Suppose a solution of a system of evolution equations is given. Assume for 
simplicity that this solution is time-independent, although a similar discus- 
sion could be carried out more generally. The solution is called stable if in 
order to ensure that a solution stays close to the original solution to any 
desired accuracy, it is enough to require it to be sufficiently close at one time. 
Closeness is measured in some appropriate norm. Well-posedness has no in- 
fluence on stability in this sense. Already for ordinary differential equations 
with smooth coefficients, which always have a well-posed initial value prob- 
lem, stability does not in general hold. Solutions of the linearized sytem about 
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the given solution can grow exponentially. However there is a constant k such 
that no linearized solution can grow faster than Ce^‘. This is a rather general 
feature of well-posed evolution equations. For instance any linear symmetric 
hyperbolic system allows an exponential bound with some constant k inde- 
pendent of the solution. If no such bound is possible, then it is said that there 
is a violent instability. (Cf. [69], Sect. 4.4 for this terminology.) The presence 
of a violent instability is closely related to ill-posedness, as can be seen in the 
above example. 

5.6 Symmetries 

If an initial data set for the Einstein-matter equations possesses symmetries, 
then we can expect these to be inherited by the corresponding solutions. This 
will be discussed here in the case of the vacuum Einstein equations. There is 
nothing in the argument which obviously makes essential use of the vacuum 
condition and it should extend to reasonable types of matter. It makes use 
of the maximal Cauchy development and so any restrictions on the matter 
model which might come up there would appear again in the present context. 

The following only covers symmetries of spacetime which leave a given 
Cauchy surface invariant. It is based on group actions rather than Killing 
vectors. A more extensive discussion of symmetries of spacetime and their 
relations to the Cauchy problem can be found in Sect. 2.1 of [26]. 

By a symmetry of an initial data set (5, hat, Xab) for the vacuum Einstein 
equations we mean a diffeomorphism ip : S ^ S which leaves hab and Xab 
invariant. Let (p be the embedding of S into its maximal Cauchy development. 
Then (p = (po ip also satisfies the properties of the embedding in the definition 
of the maximal Cauchy devlopment. Hence, by uniqueness up to isometry, 
there exists an isometry ip of the maximal Cauchy development onto itself 
such that Ip o (p = (p. This means that ip o <p = (po ip. Thus ip is an isometry 
of M whose restriction to <p{S) is equal to ip. We see that a symmetry of 
the initial data extends to a symmetry of the solution. Next we wish to 
show that this extension is unique. Since a general theorem of Lorentzian 
(or Riemannian) geometry says that two isometries which agree on a open 
set agree everywhere it suffices to show that any two isometries ip with the 
properties described agree on a neighbourhood of (p{S). Let p be a point of 
4>{S). A neighbourhood of p can be covered with Gauss coordinates based 
on (p{S). An isometry preserves geodesics and orthogonality. Hence if, when 
expressed in Gauss coordinates, it is the identity for t = 0 it must be the 
identity everywhere. This completes the proof of the uniqueness of ip. 

Now consider the situation where a Lie group G acts on S in such a 
way that each transformation ipg of S corresponding to an element of the 
group is a symmetry of the initial data. Let H be the isometry group of 
the maximal Cauchy development and Hs the group of all isometries of the 
maximal Cauchy development which leave (p{S) invariant. The group Hs is 
a closed subgroup of the Lie group H and thus is itself a Lie group. Each 
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ijjg is the restriction of a unique element i[>g of Hs- Using uniqueness again 
we must have ij^gh = ipgiph for all elements g and h of G. Thus we obtain a 
homomorphism from G to Hs- This shows that there exists an action of the 
group G on M by isometries which extends the action on arising from 
the original action on S by means of the identification using (j). However this 
argument does not show that the resulting action of G is smooth. To show 
this consider first the group Hi of all symmetries of the initial data. It is a 
closed subgroup of the isometry group of h and therefore has the structure 
of a Lie group. The above considerations show that restriction defines an 
isomorphism of groups from Hs to Hj. (We identify S with (f>{S) here.) If 
we knew that this mapping was continuous a general theorem on Lie groups 
[92] would show that it is also an isomorphism of Lie groups. The continuity 
can be seen by noting that the topology of an isometry group coming from 
its Lie group structure coincides with the compact open topology [62]. The 
continuity of the restriction mapping in the compact open topology follows 
immediately from the definitions. We conclude that as Lie groups Hs and 
Hi can be identified. 

Now we come back to the action of G. The action of G on initial data 
is a smooth mapping G x S' — >■ S'. It is the composition of a smooth homo- 
morphism from G to Hi with the action of Hi on S. By the comments of 
the last paragraph this can be identified with the composition of a smooth 
homomorphism from G to Hs with the action of Hs- In this way we obtain 
a smooth action of G on M which leaves S invariant and restricts to the 
original action on the initial data. It is the action of G which we previously 
considered. 



6 Outlook 

This article is intended to be an informative tour through its subject, rather 
than an exhaustive account. The latter would in any case be impossible in 
an article of this length, given the amount of literature which now exists. 
The aim of this section is to mention a few of the important things which 
have been left out, and to direct the reader to useful sources of information 
concerning these. A good starting point is the review article of Choquet- 
Bruhat and York [23] which is still very useful. (See also [34].) There is an 
extensive treatment of the constraints in reference [23]. For a selection of 
newer results on the constraints, see e.g. [5], [6], [57], [58], [59] and [20]. 

The most obvious omission of the present article is the lack of statements 
on the global Cauchy problem. A review with pointers to further sources can 
be found in [78]. This material is too recent to be discussed in [23] and a 
lot has happened since then. It is natural that once some of the basic local 
questions had been solved attention turned to global issues. The latter are 
now central to present research on the Cauchy problem. Most of the existing 
results concern spacetimes with high symmetry, although in the meantime 
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there are also a few theorems on space-times without symmetries which are 
small but finite perturbations of space-times of special types. This is an area 
which is developing vigorously at the moment. Often the statements obtained 
about global existence of solutions are accompanied by information on the 
global qualitative properties of the solutions. A discussion of the asymptotic 
behaviour of a particular class of solutions and its relevance for the modelling 
of certain physical systems can be found in [40] and [41] . 

In the introduction we mentioned the possible applications of ideas con- 
nected with the Cauchy problem to analytical and numerical approximations. 
Up to now progress on establishing an effective interaction between theoret- 
ical developments and the applications of approximate methods to concrete 
physical problems such as the generation of gravitational waves has been lim- 
ited. For instance, little has been done on the question of proving theorems 
on analytical approximations in general relativity since [28], [75], and [77]. 
There is no shortage of things to be done. For instance one tempting goal 
would be a precise formulation and justification of the quadrupole formula. 

As for the link to numerical relativity, the discussions in Sect. 4, apart 
from their interest for purely analytical reasons, could potentially be ex- 
ploited for improving numerical codes. Many new hyperbolic reductions have 
been suggested recently with the aim of providing equations which would 
ensure a stable time evolution. We have added a few more. We also pointed 
out various different gauge conditions. Their usefulness for stable long-time 
numerical calculations still has to be explored. We have seen that different 
representations of the field equations and different formalisms allow us to em- 
ploy different gauge conditions. The possibilities of controlling the lifetime of 
a gauge by a judicious choice of gauge source functions have neither been 
investigated analytically nor numerically in a systematic way. 

One of the main interests in the analysis of the initial boundary problem 
lies in the fact that many approaches to numerical relativity require the in- 
troduction of timelike boundary hypersurfaces which reduce the calculations 
to spatially finite grids. A good analytical understanding of the initial bound- 
ary value problem does not guarantee the stability of long-time evolutions for 
problems with timelike boundaries, but the latter are likely to fail without 
a proper understanding of the analytical background. In this respect we also 
consider the analytical investigations of the conformal Einstein equations (cf. 
[40] , [41] ) as an opportunity for numerical relativity. They allow us in princi- 
ple to calculate entire spacetimes, including their asymptotic behaviour, on 
finite grids without the need to introduce artificial boundaries in the physical 
spacetime. 

Another place where the theory of the Cauchy problem could have some- 
thing to contribute is the application of numerical methods developed for 
systems of conservation laws (or, more generally, systems of balance laws) 
to the Einstein equations, a procedure which has been popular recently. In a 
certain sense this means accepting an analogy between the Einstein equations 
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and the Euler equations. At the moment this amounts to no more than the 
fact that both systems are quasi-linear hyperbolic and can be formulated as 
systems of balance laws. Unlike the Euler equations, the Einstein equations 
do not appear to admit a preferred formulation of this kind. There are differ- 
ent alternatives and no known criterion for choosing between them. It is very 
tempting to import the vast amount of knowledge which has been accumu- 
lated concerning the numerical solution of the Euler equations into general 
relativity. On the other hand it is not at all clear how many of these tech- 
niques are really advantageous for, say, the vacuum Einstein equations. One 
might hope that analytical theory could throw some light on these questions. 

There can be little doubt that increased cooperation between people work- 
ing on analytical and numerical aspects of the evolution of solutions of the 
Einstein equations would lead to many new insights. As both fields progress 
cases where a rewarding collaboration would be possible are bound to present 
themselves. It suffices for someone to show the initiative required to profit 
from this situation. 

An open problem which has been touched on already is that of the exis- 
tence of solutions of the Einstein-Euler system describing fluid bodies. This 
kind of free boundary problem is poorly understood even in classical physics, 
although there has been significant progress recently [95] . The central impor- 
tance of this becomes clear when it is borne in mind that the usual applica- 
tions of gravitational theory in astrophysics (except for cosmology) concern 
self-gravitating bodies. 

We hope to have succeeded in showing in this article that the study of the 
Cauchy problem for the Einstein equations is a field which presents a variety 
of fascinating challenges. Perhaps, with luck, it will stimulate others to help 
tackle them. 
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1 Introduction 

1.1 On Approximation Methods in General Relativity 

Let us declare that the most important devoir of any physical theory is to 
draw firm predictions for the outcome of laboratory experiments and astro- 
nomical observations. Unfortunately, the devoir is quite difficult to fulfill in 
the case of general relativity, essentially because of the complexity of the 
Einstein field equations, to which only few exact solutions are known. For 
instance, it is impossible to settle the exact prediction of this theory when 
there are no symmetry in the problem (as is the case in the problem of the 
gravitational dynamics of separated bodies). Therefore, one is often obliged, 
in general relativity, to resort to approximation methods. 

It is beyond question that approximation methods do work in general rel- 
ativity. Some of the great successes of this theory were in fact obtained using 
approximation methods. We have particularly in mind the test by Taylor 
and collaborators [1-3] regarding the orbital decay of the binary pulsar PSR 
1913-1-16, which is in agreement to within 0.35% with the general-relativistic 
post- Newtonian prediction. However, a generic problem with approximation 
methods (especially in general relativity) is that it is non trivial to define a 
clear framework within which the approximation method is mathematically 
well-defined, and such that the results of successive approximations could 
be considered as theorems following some precise (physical and/or technical) 
assumptions. Even more difficult is the problem of the relation between the 
approximation method and the exact theory. In this context one can ask: 
What is the mathematical nature of the approximation series (convergent, 
asymptotic, . . .)? What its “reliability” is (i.e., does the approximation series 
come from the Taylor expansion of a family of exact solutions)? Does the 
approximate solution satisfy some “exact” boundary conditions (for instance 
the no-incoming radiation condition)? 

Since the problem of theoretical prediction in general relativity is com- 
plex, let us distinguish several approaches (and ways of thinking) to it, and 
illustrate them with the example of the prediction for the binary pulsar. First 
we may consider what could be called the “physical” approach, in which one 
analyses the relative importance of each physical phenomena at work by using 
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crude numerical estimates, and where one uses only the lowest-order approx- 
imation, relating if necessary the local physical quantities to observables by 
means of balance equations (perhaps not well defined in terms of basic theo- 
retical concepts) . The physical approach to the problem of the binary pulsar 
is well illustrated by Thorne in his beautiful Les Houches review [4] (see also 
the round table discussion moderated by Ashtekar [5]): one derives the loss of 
energy by gravitational radiation from the (Newtonian) quadrupole formula 
applied formally to point-particles, assumed to be test-masses though they 
are really self-gravitating, and one argues “physically” that the effect comes 
from the variation of the Newtonian binding energy in the center-of-mass 
frame - indeed, on physical grounds, what else could this be (since we expect 
the rest masses won’t vary)? The physical approach yields the correct result 
for the rate of decrease of the period of the binary pulsar. Of course, think- 
ing physically is extremely useful, and indispensable in a preliminary stage, 
but certainly it should be completed by a solid study of the connection to 
the mathematical structure of the theory. Such a study would a posteriori 
demote the physical approach to the status of “heuristic” approach. On the 
other hand, the physical approach may fall short in some situations requiring 
a sophisticated mathematical modelling (like in the problem of the dynam- 
ics of singularities), where one is often obliged to follow one’s mathematical 
rather than physical insight. 

A second approach, that we shall qualify as “rigorous”, has been advo- 
cated mainly by Jurgen Ehlers (see, e.g., [6]). It consists of looking for a high 
level of mathematical rigor, within the exact theory if possible, and otherwise 
using an approximation scheme that we shall be able to relate to the exact 
theory. This does not mean that we will be so much wrapped up by math- 
ematical rigor as to forget about physics. Simply, in the rigorous approach, 
the prediction for the outcome of an experiment should follow mathematically 
from first theoretical principles. Clearly this approach is the one we should 
ideally adhere to. As an example, within the rigorous approach, one was not 
permitted, by the end of the seventies, to apply the standard quadrupole 
formula to the binary pulsar. Indeed, as pointed out by Ehlers et al [7], it 
was not clear that gravitational radiation reaction on a self-gravitating sys- 
tem implies the standard quadrupole formula for the energy flux, notably 
because computing the radiation reaction demands a priori three non-linear 
iterations of the field equations [8], which were not fully available at that 
time. Ehlers and collaborators [7] remarked also that the exact results con- 
cerning the structure of the field at infinity (nota bly the asymptotic shear 
of null geodesics whose variation determines the flux of radiation) were not 
connected to the actual dynamics of the binary. 

Maybe the most notable result of the rigorous approach concerns the 
relation between the exact theory and the approximation methods. In the 
case of the post-Newtonian approximation (limit c — >■ oo), Jurgen Ehlers has 
provided with his frame theory [9-11] a conceptual framework in which the 
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post-Newtonian approximation can be clearly formulated (among other pur- 
poses) . This theory unifies the theories of Newton and Einstein into a single 
generally covariant theory, with a parameter 1/c taking the value zero in the 
case of Newton and being the inverse of the speed of light in the case of 
Einstein. Within the frame theory not only does one understand the limit re- 
lation of Einstein’s theory to Newton’s, but one explains why it is legitimate 
when describing the predictions of general relativity to use the common-sense 
language of Newton (for instance thinking that the trajectories of particles 
in an appropriately defined coordinate system take place in some Euclidean 
space, and viewing the coordinate velocities as being defined with respect to 
absolute time). It was shown by Lottermoser [12] that the constraint equa- 
tions of the (Hamiltonian formulation of the) Ehlers frame theory admit 
solutions with a well-defined post-Newtonian limit. Further in the spirit of 
the rigorous approach, we quote the work of Rendall [13] on the definition 
of the post-Newtonian approximation, and the link to the post-Newtonian 
equations used in practical computations. (See also [14,15] for an attempt 
at showing, using restrictive assumptions, that the post-Newtonian series is 
asymptotic.) 

The important remarks of Jurgen Ehlers et al [7] on the applicability of 
the quadrupole formula to the binary pulsar stimulated research to settle 
down this question with (al least) acceptable mathematical rigor. The ques- 
tion was finally answered positively by Damour and collaborators [16-19], 
who obtained in algebraically closed form the general-relativistic equations 
of motion of two compact objects, up to the requisite 5/2 post-Newtonian 
order (2.5PN order or 1/c®) where the gravitational radiation reaction force 
appears. This extended to 2.5PN order the work at IPN of Lorentz and 
Droste [20], and Einstein, Infeld and Hoffmann [21]. The net result is that 
the dynamics of the binary pulsar as predicted by (post-Newtonian) general 
relativity is in full agreement both with the prediction of the quadrupole 
formula, as derived earlier within the “physical” approach, and with the ob- 
servations by Taylor et al (see [22] for discussion). 

Motivated by the success of the theoretical prediction in the case of the 
binary pulsar [16-19,22], we shall try to follow in this article the spirit of the 
“rigorous” approach of Jurgen Ehlers, notably in the way it emphasizes the 
mathematical proof, but we shall also differ from it by a systematic use of 
approximation methods. This slightly different approach recognizes from the 
start that in certain difficult problems, it is impossible to derive a physical 
result all the way through the exact theory without any gap, so that one must 
proceed with approximations. But, in this approach, one implements a math- 
ematically well-defined framework for the approximation method, and within 
this framework one proves theorems that (ideally) guarantee the correctness 
of the theoretical prediction to be compared with experiments. Because the 
comparison with experiments is the only thing which matters in fine for a 
pragmatist, we qualify this third approach as “pragmatic”. 
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In this article we describe the pragmatic approach to the problem of 
gravitational radiation emitted by a general isolated source, based on the 
rigorous post-Minkowskian iteration of the field outside the source [23], and 
on the general connection of the exterior field to the field inside a slowly- 
moving source [24,25]. Note that for this particular problem the pragmatic 
approach is akin to the rigorous one in that it permits to establish some 
results on the connection between approximate and exact methods. For in- 
stance it was proved by Damour and Schmidt [26] (see also [27,28]) that the 
post-Minkowskian algorithm generates an asymptotic approximation to exact 
solutions, and it was shown [29] that the solution satisfies to any order in the 
post-Minkowskian expansion a rigorous definition of asymptotic flatness at 
future null infinity. However it remains a challenge to analyse in the manner 
of the rigorous approach the relation to exact theory of the whole formalism 
of [23-25,29]. 

By combining the latter post-Minkowskian approximation and a post- 
Newtonian expansion inside the system, it was proved (within this frame- 
work of approximations) that the quadrupole formula for slowly- moving, 
weakly-stressed and self-gravitating systems is correct, even including post- 
Newtonian corrections [30] ; and idem for the radiation reaction forces acting 
locally inside the system, and for the associated balance equations [31,32]. 
These results answered positively Ehlers’ remarks [7] in the case of slowly- 
moving extended (fluid) systems. However we are also interested in this article 
to the application to binary systems of compact objects modelled by point- 
masses. Indeed the latter sources of radiation are likely to be detected by 
future gravitational-wave experiments, and thus concern the pragmatist. We 
shall see how one can address the problem in this case. (When specialized to 
point-mass binaries, the results on radiation reaction [31,32] are in agreement 
with sepa rate work of Iyer and Will [33,34].) For other articles on the prob- 
lem of gravitational radiation from general and binary point-mass sources, 
see [35-39]. 

1.2 Field Equations and the No-Incoming-Radiation Condition 

The problem is to find the solutions, in the form of analytic approximations, 
of the Einstein field equations in 

R^'' - \g^’^R = , ( 1 ) 

and thus also of their consequence, the equations of motion of the matter 
source, = 0. Throughout this work we assume the existence and 

unicity of a global harmonic (or de Bonder) coordinate system. This means 
that we can choose the gauge condition 



= 0 ; 






( 2 ) 
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where g and denote the determinant and inverse of the covariant metric 
and where is an auxiliary flat metric [i.e. = diag(— 1, 1, 1, 1) = 

The Einstein held equations (1) can then be replaced by the relaxed 
equations 



= 



IGttG 



( 3 ) 



where the box operator is the flat d’Alembertian, □ = and 

where the source term is the sum of a matter part and a gravitational part, 



^ \g\T>^'' + . (4) 

In harmonic coordinates the held equations take the form of simple wave 
equations, but whose source term is actually a complicated functional of the 
gravitational held notably the gravitational part depends on and its 
first and second space-time derivatives: 



+ dphP'^d^h'^P + ^g^'^gpadxhP^drh’^^ 

- gPPgardxh''^dph’^^ - g''PgardxhP^dph^^ + gp^g^^ Oxh^POrh''^ 

+ \{2gPPg'^^ - gP''gP^){2gxrge. ~ gr.gx.)dph>'^ . (5) 

The point is that is at least quadratic in h, so the relaxed held equations 
(3) are very naturally amenable to a perturbative non-linear expansion. As 
an immediate consequence of the gauge condition (2), the right side of the 
relaxed equations is conserved in the usual sense, and this is equivalent to 
the equations of motion of matter: 

d^rP'' = 0 V^TP'' = 0 . ( 6 ) 

We refer to as the total stress-energy pseudo-tensor of the matter and 
gravitational fields in harmonic coordinates. Since the harmonic coordinate 
condition is Lorentz covariant, is a tensor with respect to Lorentz trans- 
formations (but of course not with respect to general diffeomorphisms) . 

In order to select the physically sensible solution of the held equations in 
the case of a bounded system, one must choose some boundary conditions 
at infinity, i.e. the famous no-incoming radiation condition, which ensures 
that the system is truly isolated (no radiating sources located at infinity). 
In principle the no-incoming radiation condition is to be formulated at past 
null infinity J~ . Here, we shall simplify the formulation by taking advantage 
of the presence of the Minkowski background gpn to define the no-incoming 
radiation condition with respect to the Minkowskian past null infinity 
Of course, this does not make sense in the exact theory where only exists the 
metric g^^ and where the metric is flctituous, but within approximate 
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(post-Minkowskian) methods it is legitimate to view the gravitational field 
as propagating on the fiat background since does exist at any finite 
order of approximation. 

We formulate the no-incoming radiation condition in such a way that it 
suppresses any homogeneous, regular in IR'^, solution of the d’Alembertian 
equation Dh = 0. We have at our disposal the Kirchhoff formula which ex- 
presses h(x',f') in terms of values of h(x,t) and its derivatives on a sphere 
centered on x' with radius p = |x' — x| and at retarded time t = t' — pjc. 



Mx',0 = / /f 



d , ^ 9 . 



(x,t) 



( 7 ) 



where dfi is the solid angle spanned by the unit direction (x — x') j p. From 
the Kirchhoff formula we obtain the no-incoming radiation condition as a 
limit at that is r — >■ -|-oo with t + rjc =const (where r = |x|). In fact we 
obtain two conditions: the main one, 



liin 

t+r/c=const 






c dt 






(x,f) = 0 , 



( 8 ) 



and an auxiliary condition, that rd\h^'^ should be bounded at coming 
from the fact that p in the Kirchhoff formula (7) differs from r [we have 
p = r — x'.n -I- 0(l/r) where n = x/r]. 

In fact, we adopt in this article a much more restrictive condition of no- 
incoming radiation, namely that the field is stationary before some finite 
instant — T in the past: 



t<-r ^ ^[h'^'^(x,t)] = o . (9) 

In addition we assume that before — T the field h^'^(x) is of order 0(l/r) 
when r — >■ -|-oo. These restrictive conditions are imposed for technical reasons 
following [23] , since they allow constructing rigorously (and proving theorems 
about) the metric outside some time-like world tube r = |x I > TZ. We shall 
assume that the region r > TZ represents the exterior of an actual compact- 
support system with constant radius d < TZ [i.e. d is the maximal radius of 
the adherence of the compact support of T^'^(x,t), for any time tj. 

Now if satisfies for instance (9), so does the pseudo-tensor built 
on it, and then it is clear that the retarded integral of satisfies itself the 
same condition. Therefore one infers that the unique solution of the Einstein 
equation (3) satisfying the condition (9) is 

, ( 10 ) 

where the retarded integral takes the standard form 



(□^V)(x,t) 



1 

47T 




(x',t 



|x-x'|/c) . 



( 11 ) 
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Notice that since depends on h and its derivatives, the equation (10) is to 
be viewed rather as an integro-differential equation equivalent to the Einstein 
equation (3) with no-incoming radiation. 

1.3 Method and General Physical Picture 

We want to describe an isolated system, for instance a “two-body system”, 
in Einstein’s theory. We expect (though this is not proved) that initial data 
sets dig^u, P, v satisfying the constraint equations on the space-like 
hypersurface t = to exist, and that this determines a unique solution of the 
field equations for any time t, which approaches in the case of two bodies 
a “scattering state” when t — > — oo, in which the bodies move on unbound 
(hyperbolic-like) orbits. We assume that the space-times generated by such 
data admit a past null infinity (or, if one uses approximate methods, 
with no incoming radiation. (Note that in a situation with initial scattering 
the field might not satisfy the rigorous definitions of asymptotic flatness at 
J~] see [40“43].) The point to make is that in this class of space-times there 
is no degree of freedom for the gravitational field (we could consider other 
situations where the motion is influenced by incoming radiation). 

Both our technical assumptions of compact support for the matter source 
(with constant radius d) and stationarity before the time —'T contradict 
our expectation that a two-body system follows an unbound orbit in the 
remote past. We do not solve this conflict but argue as follows: (i) these 
technical assumptions permit to derive rigorously some results, for instance 
the expression [given by (52) with (56) below] of the far-held of an isolated 
past-stationary system; (ii) it is clear that these results do not depend on 
the constant radius d, and furthermore we check that they admit in the 
“scattering” situation a well-dehned limit when — T — >■ — oo ; (iii) this makes 
us conhdent that the results are actually valid for a more realistic class of 
physical systems which become unbound in the past and are never stationary 
(and, even, one can give a posteriori conditions under which the limit — T — >■ 
— oo exists for a general system at some order of approximation). 

Suppose that the system is “slowly-moving” [in the sense of (12) below], so 
that we can compute the held inside its compact support by means of a post- 
Newtonian method, say where the overbar refers to the formal 

post-Newtonian series. The post-Newtonian iteration (say, for hydrodynam- 
ics) is not yet dehned to all orders in 1/c, but many terms are known: see the 
works of Lorentz and Droste [20], Einstein, Infeld and Hohmann [21], Fock 
[44], Chandrasekhar and collaborators [45-47], Ehlers and followers [48-54], 
and many other authors [55-58,30,24]. 

On the other hand, outside the isolated system, the held is weak every- 
where and it satishes the vacuum equations. Therefore, the equations can 
be solved conjointly by means of a weak-held or post-Minkowskian expan- 
sion (G — >■ 0), and, for each coefhcient of G” in the latter expansion, by 
means of a multipole expansion (valid because we are outside). The general 
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Multipolar-post-Minkowskian (MPM) metric was constructed in [23,29] as a 
fuirctioiral of two sets of “multipole moments” and S'L(t) which were 

left arbitrary at this stage (i.e. not comrected to the source). The idea of com- 
biniirg the post-Minkowskian aird mriltipole expansions comes from the works 
of Bonnor [59] and Thorne [60]. We denote by the exterior 

solution, where A4 stands for the multipole expansion (as it will turn out, 
the post-Minkowskian expansion appears iir this formalism to be somewhat 
less fmrdamental than the multipole expansion). 

The key assumption is that the two expansions and = 

should match in a region of common validity for both the post- 
Newtoniair and multipole expansions. Here is where our physical restriction 
to slow motioir plays a crucial role, because such an overlap regioir exists 
(this is the so-called exterior irear-zone) if and only if the system is slowly- 
moviirg. The matching is a variairt of the well-kirown method of matching 
of asymptotic expansioirs, very useful iir gravitatioiral radiatioir theory [Gi- 
bb, 30, 66, 24]. It consists of decomposing the inner solution into multipole mo- 
ments (valid in the outside), re-expandiirg the exterior solution in the near 
zone (r/c — > 0), and equating term by term the two resulting expansion se- 
ries. From the requirement of matching we obtain in [25], and review in Sects. 
2 and 3 below, the general formula for the multipole expansion A4(/i^“^) in 
terms of the “source” mriltipole moments (notably a mass-type moment 
and a current- type J^), given as functionals of the post-Newtonian expan- 
sion of the pseudo-tensor, i.e. . [The previous moments Ml and Sl (re- 
ferred below to as “canonical”) are deduced from the source moments after 
a suitable coordinate transformation.] In addition the matching equation de- 
termines the radiation reaction contributions in the inner post-Newtonian 
metric [67,68,32]. 

To obtain the source multipole moments in terms of basic source para- 
meters (mass density, pressure), it remains to replace by the result of an 
explicit post-Newtonian iteration of the inner field. This was done to IPN 
order in [30,66], then to 2PN order in [24], and the general formulas obtained 
in [25] permit recovering these results. See Sect. 6. On the other hand, if 
one needs the equations of motion of the source, simply one inserts the post- 
Newtonian metric into the conservation law = 0. (Note that we are 

speaking of the equations of motion, which take for instance the form of Euler- 
type equations with many relativistic corrections, but not of the solutions of 
these equations, which are typically impossible to obtain analytically.) 

From the harmonic coordinates, one can perform to all post-Minkowskian 
orders [29] a coordinate transformation to some radiative coordinates such 
that the metric admits a far-held expansion in powers of the inverse of the dis- 
tance R (without the powers of Ini? which plague the harmonic coordinates). 
Considering the leading order 1/i? one compares the exterior metric, which is 
parametrized by the source moments (connected to the source via the match- 
ing equation), to the metric dehned with “radiative” multipole moments, say 
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Ul and Vl- This gives Ul and Vl in terms of the source moments, notably 
II and Jl, and a fortiori of the source parameters. This solves, within ap- 
proximate methods, the problem of the relation between the far field and the 
source. The radiative moments have been obtained with increasing precision 
reaching now 3PN [69-71], as reviewed in Sect. 5. 

The previous scheme is developed for a general description of matter, 
however restricted to be smooth (we have in mind a general “hydrodynami- 
cal” Thus the scheme a priori excludes the presence of singularities (no 
“point-particles” or black holes), but this is a serious limitation regarding the 
application to compact objects like neutron stars, which can adequately be 
approximated by point-masses when studying their dynamics. Fortunately, 
the formalism is applicable to a singular involving Dirac measures, at 
the price of a further ansatz, that the infinite self-field of point-masses can be 
regularized in a certain way. By implementing consistently the regularization 
we obtain the multipole moments and the radiation field of a system of two 
point-masses at 2.5PN order [72,73], as well as their equations of motion at 
the same order in the form of ordinary differential equations [74] (the result 
agrees with previous works [16-19]); see Sect. 7. 

2 Multipole Decomposition 

In this section we construct the multipole expansion A4(/i^'^) = of the 
gravitational field outside an isolated system, supposed to be at once self- 
gravitating and slowly-moving. By slowly-moving we mean that the typical 
current and stress densities are small with respect to the energy density, in 
the sense that 



max 





( 12 ) 



where 1/c denotes (slightly abusively) the small post-Newtonian parameter. 
The point about (12) is that the ratio between the size of the source d and 
a typical wavelength of the gravitational radiation is of order d/X = 0{l/c). 
Thus the domain of validity of the post-Newtonian expansion covers the 
source: it is given by r < 6 where the radius b can be chosen so that d <b = 
0{X/c). 



2.1 The Matching Equation 

The construction of the multipole expansion is based on several technical 
assumptions, the crucial one being that of the consistency of the asymptotic 
matching between the exterior and interior fields of the isolated system. In 
some cases the assumptions can be proved from the properties of the exterior 
field /igxt as obtained in [23] by means of a post-Minkowskian algorithm. How- 
ever, since our assumptions are free of any reference to the post-Minkowskian 
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expansion, we prefer to state them more generally, without invoking the ex- 
istence of such an approximation (refer to [25] for the full detailed assump- 
tions). In many cases the assumptions have been explicitly verified at some 
low post-Newtonian orders [30,66,24,73]. 

The field h (skipping space-time indices), solution in of the relaxed 
field equations and the no-incoming radiation condition, is given as the re- 
tarded integral (10). We now assume that outside the isolated system, say, in 
the region r > TZ where 77. is a constant radius strictly larger than d, we have 
h = Ai{h) where A4{h) denotes the multipole expansion of h, a solution of 
the vacuum field equations in deprived from the spatial origin r = 0, and 
admitting a spherical-harmonics expansion of a certain structure (see below) . 
Thus, in 7R X Ml where Ml = M^ — {0}, 



= 0 , (13a) 

. (13b) 

The source term M.{A) is obtained from inserting A4(h) in place of h into 

(5), i.e. M{A) = A{M{h)). [Since the matter tensor has a compact support, 

4 

M{T) = 0 so that M{t) = ^q^q A4{A).] Of course, inside the source (when 
r < d), the true solution h differs from the vacuum solution A4{h), the latter 
becoming in fact singular at the origin (r = 0). We assume that the spherical- 
harmonics expansion of A4{h) in M x Ml reads 

7W(/i)(x,t) = ^ nLr“(lnr)^Li^a,p(t) + -R 7 v(x,i) . (14) 

a<N 

This expression is valid for any N G IN. The powers of r are positive or 
negative, a G Z, and we have a < N (the negative powers of r show that the 
multipole expansion is singular at r = 0). For ease of notation we indicate 
only the summation over a, but there are two other summations involved: 
one over the powers p G IN of the logarithms, and one over the order of 
multipolarity I G IN. The summations are considered only in the sense of 
formal series, as we do not control the mathematical nature of the series. 
The factor fiL is a product of I unit vectors, til = = n*b..n*b where 

L = is a multi-index with I indices, on which the symmetric and trace- 

free (STF) projection is applied: hi = STF[ni]. The decomposition in terms 
of STF tensors hi(0, (p) is equivalent to the decomposition in usual spherical 
harmonics. The functions LFa,p{t) are smooth {C°°) functions of time, which 
become constant when t < — T because of our assumption (9). [Of course, the 
bFa.p's depend also on c : LFa,p{t, c).] Finally the function Rn{x, t) is defined 
by continuity throughout 7R^. Its two essential properties are Rn G {M^) 
and Rn = 0{r^) when r — 0 with fixed t. In addition Rn is zero before 
the time — T. Though the function 7?jv(x,t) is given “globally” (as is the 
multipole expansion), it represents a small remainder 0{r^) in the expansion 
of A4{h) when r — >■ 0, which is to be identified with the “near-zone” expansion 
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of the field outside the source. It is convenient to introduce a special notation 
for the formal near-zone expansion (valid to any order N): 

7W(/i)(x,t) = ^hLr“(lnr)PiFa,p(t) , (15) 

where the summation is to be understood in the sense of formal series. [Note 
that (14) and (15) are written for the field variable A4{h), but it is easy to 
check that the same type of structure holds also for the source term 

Our justification of the assumed structure (14) is that it has been proved to 
hold for metrics in the class of Multipolar-post-Minkowskian (MPM) metrics 
considered in [23], i.e. formal series hext = which satisfy the vacuum 

equations, are stationary in the past, and depend on a finite set of indepen- 
dent multipole moments. More precisely, from the Theorem 4.1 in [23], the 
general MPM metric hext, that we identify in this paper with is 

such that the property (14) holds for the h„’s to any order n, with the only 
difference that to any finite order n the integers a,p,l vary into some finite 
ranges, namely amin(?T-) <a<N,0<p<n — I and 0 < I < Imaxiji), with 
<^min(n) —oo and Imax(n) — >■ +00 when n — >■ -l-oo. The functions L^a,p and 
the remainder Rjy in (14) should therefore be viewed as post-Minkowskian se- 
ries ^ G’^Lf'a,p,n and ^ G'^Rn,ti- What we have done in writing (14) and (15) 
is to assume that one can legitimately consider such formal post-Minkowskian 
series. Note that because the general MPM metric represents the most gen- 
eral solution of the field equations outside the source (Theorem 4.2 in [23]), 
it is quite appropriate to identify the general multipole expansion Aifih) with 
the MPM metric h-ext- Actually we shall justify this assumption in Sect. 5 by 
recovering from step by step in the post-Minkowskian expansion, the 

MPM metric hext- Because the properties are proved in [23] for any n, and 
because we consider the formal post-Minkowskian sum, we see that (14)-(15), 
viewed as if it were “exact”, constitutes a quite natural assumption. In par- 
ticular we have assumed in (14)-(15) that the multipolar series involves an 
infinite number of independent multipoles. In summary, we give to the prop- 
erties (14)-(15) a scope larger than the one of MPM expansions (maybe they 
could be proved for exact solutions), at the price of counting them among 
our basic assumptions. 

The multipole expansion M.{h) is a mathematical solution of the vacuum 
equations in JR x IR^, but whose “multipole moments” (the functions LFa,p) 
are not determined in terms of the source parameters. When the isolated 
system is slowly moving in the sense of (12), there exists an overlapping 
region between the domains of validity of the post-Newtonian expansion: the 
“near-zone” r < b, where d < b = 0(A/c), and of the multipole expansion: 
the exterior zone r > TZ. For this to be true it suffices to choose TZ, which is 
restricted only to be strictly larger than d, such that d < TZ < b. We assume 
that the field h given by (10) admits in the near-zone a formal post-Newtonian 
expansion, h = h when r < b. On the other hand, recall that h = Ai{h) 
when r > TZ. Matching the two asymptotic expansions h and Ai{h) in the 
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“matching” region TZ < r < b means that the (formal) double series obtained 
by considering the multipole expansion of (all the coefficients of) the post- 
Newtonian expansion h is identical to the double series obtained by taking 
the near-zone expansion of the multipole expansion. [We use the same overbar 
notation for the post-Newtonian and near-zone expansions because the near- 
zone expansion (r/c — >■ 0) of the exterior multipolar field is mathematically 
equivalent to the expansion when c — >• oo with fixed multipole moments.] The 
resulting matching equation reads 



M{h) = M{h). (16) 

This equation should be true term by term, after both sides of the equation 
are re-arranged as series corresponding to the same expansion parameter. 
Though looking quite reasonable (if the theory makes sense), the matching 
equation cannot be justified presently with full generality; however up to 2PN 
order it was shown to determine a unique solution valid everywhere inside 
and outside the source [30,66,24]. The matching assumption complements 
the framework of MPM approximations [23], by giving physical “pith” to the 
arbitrary multipole moments used in the construction of MPM metrics (see 
Sect. 4). 

2.2 The Field in Terms of Multipole Moments 

Let us consider the relaxed vacuum Einstein equation (13b), whose source 
term A4(T), according to our assumptions, owns the structure (14) [recall 
that (14) applies to A4{h) as well as M.{A)]. We obtain a particular soiwtion of 
this equation (in IR x M^) as follows. First we multiply each term composing 
M.{A) in (14) by a factor (r/rg)'®, where B is a complex number and rg 
a constant with the dimension of a length. For each term we can choose 
the real part of B large enough so that the term becomes regular when 
r — >■ 0, and then we can apply the retarded integral (11). The resulting 
i3-dependent retarded integral is known to be analytically continuable for 
any B G C except at integer values including in general the value of interest 
B = 0. Furthermore one can show that the finite part (in short FPs=c) of this 
integral, defined to be the coefficient of the zeroth power of B in the expansion 
when B > 0, is a retarded solution of the corresponding wave equation. In the 
case of a regular term in (14) such as the remainder i? 7 v, this solution simply 
reduces to the retarded integral. Summing all these solutions, corresponding 
to all the separate terms in (14), we thereby obtain as a particular solution 
of (13b) the object FP^^g □jj^[(r/ro)^A4(d)[. This is basically the method 
employed in [23] to solve the vacuum field equations in the post-Minkowskian 
approximation. 

Now all the problem is to find the homogeneous solution to be added to 
the latter particular solution in order that the multipole expansion M.{h) 
matches with the post-Newtonian expansion h, solution within the source 
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of the field equation (3) [or, rather, (10)]. Finding this homogeneous solu- 
tion means finding the general consequence of the matching equation (16). 
The result [24,25] is that the multipole expansion satisfying the Einstein 
equation (10) together with the matching equation (16) reads 



where the first term is the previous particular solution, and where the second 
term is a retarded solution of the source-free (homogeneous) wave equation, 
whose “multipole moments” are given explicitly by {u = t — r /c) 



Here denotes the post-Newtonian expansion of the stress-energy pseudo- 
tensor appearing in the right side of (10). In (17) and (18) we denote 
L = ii.. ,ii and 8l = di^. .. , xl = . . . Xi^ . 

It is important that the multipole moments (18) are found to depend 
on the post- Newtonian expansion of the pseudo-tensor, and not of 
itself, as this is precisely where our assumption of matching to the inner 
post-Newtonian field comes in. The formula is a priori valid only in the 
case of a slowly-moving source; it is a priori true only after insertion of a 
definite post-Newtonian expansion of the pseudo-tensor, where in particular 
all the retardations have been expanded when c — >■ oo [the formulas (17)-(18) 
assume implicitly that one can effectively construct such a post-Newtonian 
expansion] . 

Like in the first term of (17), the moments (18) are endowed with a finite 
part operation defined by complex analytic continuation in B. Notice however 
that the two finite part operations in the first term of (17) and in (18) act 
quite differently. In the first term of (17) the analytic continuation serves at 
regularizing the singularity of the multipole expansion at the spatial origin 
r = 0. Since the pseudo-tensor is smooth inside the source, there is no need in 
the moments (18) to regularize the field near the origin; still the finite part is 
essential because it applies to the bound of the integral at infinity (|x| — >■ oo). 
Otherwise the integral would be (a priori) divergent at infinity, because of 
the presence of the factor = 0(r^) in the integrand, and the fact that the 
pseudo-tensor is non-compact supported. The two finite parts present in 
the two separate terms of (17) involve the same arbitrary constant tq, but 
this constant can be readily checked to cancel out between the two terms [i.e. 
the differentiation of with respect to r^ yields zero]. 

The formulas (17)-(18) were first obtained (in STF form) up to the 2PN 
order in [24] by performing explicitly the matching. This showed in particular 
that the matching equation (16) is correct to 2PN order. Then the proof valid 
to any post-Newtonian order, but at the price of assuming (16) to all orders. 




iy-Htit-r/c)) 



(17) 




(18) 
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was given in Sect. 3 of [25] (see also Appendix A of [25] for an alternative 
proof). The crucial step in the proof is to remark that the finite part of 
the integral of A4(A) over the whole space is identically zero by analytic 
continuation: 



FPb^o [ \x/ro\^XLM{A){:s.,u) = 0 . (19) 

This follows from the fact that A1(A) can be written as a formal series of 
the type (15). Using (15) it is easy to reduce the computation of the integral 
(19) to that of the elementary radial integral (since the 

powers of the logarithm can be obtained by repeatedly differentiating with 
respect to B). The latter radial integral can be split into a “near-zone” inte- 
gral, extending from zero to radius TZ, and a “far-zone” integral, extending 
from TZ to infinity (actually any finite non-zero radius fits instead of TZ). 
When the real part of i? is a large enough positive number, the value of the 
near-zone integral is / {B -|- 3 -I- Z -I- a), while when the real part 

of i? is a large negative number, the far-zone integral reads the opposite, 
—T^B+z+i+a ! ^B -I- 3 -I- / -I- a) . Both obtained values represent the unique an- 
alytic continuations of the near-zone and far-zone integrals for any B £ C 
except —3 — I — a. The complete integral djxj jxj®+^+^+“ is defined as 
the sum of the analytic continuations of the near-zone and far-zone integrals, 
and is therefore identically zero (Vi? € C); this proves (19). 

One may ask why the whole integration over TR^ contributes to the mul- 
tipole moment (18) - a somewhat paradoxical fact because the integrand 
is in the form of a post-Newtonian expansion, and is thus expected to be 
physically valid (i.e. to give accurate results) only in the near zone. This fact 
is possible thanks to the technical identity (19) which enables us to trans- 
form a near-zone integration into a complete IR^-integration (refer to [25] for 
details) . 

2.3 Equivalence with the Will Wiseman Multipole Expansion 

Recently a different expression of the multipole decomposition, with correla- 
tively a different expression of the multipole moments, was obtained by Will 
and Wiseman [75], extending previous work of Epstein and Wagoner [76] and 
Thorne [60] . Basically, the multipole moments in [75] are defined by an inte- 
gral extending over a ball of finite radius TZ (essentially the same TZ as here) , 
and thus do not require any regularization of the bound at infinity. By con- 
trast, our multipole moments (18) involve an integration over the whole TR^, 
which is allowed thanks to the analytic continuation [leading to the identity 
(19)]. Let us outline the proof of the equivalence between the Will-Wiseman 
formalism [75] and the present one [24,25]. 

Will and Wiseman [75] find, instead of (17)-(18), 
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M{hn = ^R\M{An]\n-^^^-^dA\wl''{t-r/c)\ . (20) 

i=o ■ ^ ^ 

The first term is given by the retarded integral (11) acting on M.{A), but 
truntated, as indicated by the subscript TZ, to extend only in the “far zone” : 
|x'| > 7?. in the notation (11). Thus, the near-zone part of the retarded 
integral, which contains the source, is removed, and there is no problem with 
the singularity of the multipole expansion at the origin. Then, the multipole 
moments Wl are given by an integral extending over the “near zone” only: 

W^‘'{u)= f . ( 21 ) 

7|x|<7?, 

The integral being compact-supported is well-defined. The multipole mo- 
ments Wi look technically more simple than ours given by (18). On the 
other hand, practically speaking, the analytic continuation in (18) permits 
deriving many closed-form formulas to be used in applications [72,77]. Of 
course, one is free to choose any definition of the multipole moments as far 
as it is used in a consistent manner. 

We compute the difference between the moments TLl and Wl- For the 
comparison we split 77 l into far-zone and near-zone integrals corresponding 
to the radius TZ. Since the analytic continuation factor in TTl deals only with 
the bound at infinity, it can be removed from the near-zone integral, which 
is then clearly seen to agree with Wl- So the difference Hl — y^L is given by 
the far-zone integral: 

■Hi ( m) - Wl(u) = FPb^o / |x/ro|®XLr(x,it) . (22) 

d|x|>7?. 

Next we transform the integrand. Successively we write r = M(t) because 
we are in the far zone; A4{t) = A4{t) from the matching equation (16); 

4 

and M{t) = because T has a compact support. At this stage, 

the technical identity (19) allows one to transform the far-zone integration 
into a near zone integration (changing simply the overall sign in front of the 
integral). So, 



4 p 

77l(u) -Wl{u) = ~-^-^FPb=o / \x/ro\^XLM{A){x,u) . (23) 

IGttG d|x|<-R 

It is straightforward to check that the right side of this equation, when 
summed up over all multipolarities I, accounts exactly for the near-zone part 
that was removed from the retarded integral of M.{A) [first term in (20)], so 
that the “complete” retarded integral as given by the first term in (17) is 
exactly reconstituted. In conclusion the two formalisms [24,25] and [75] are 
equivalent. 
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3 Source Multipole Moments 

Quite naturally our source multipole moments will be closely related to the 
obtained in (18). However, before giving a precise definition, we need 
to find the equivalent of the multipole decomposition (17)-(18) in terms of 
symmetric and trace- free (STF) tensors, and we must reduce the number of 
independent tensors by imposing the harmonic gauge condition (13a). This 
leads to the definition of a “linearized” metric associated with the multi- 
pole expansion and parametrized by six sets of STF source multipole 

moments. 

3.1 Multipole Expansion in Symmetric Trace-Free Form 

The moments 'Hl given by (18) are non-trace-free because xl owns all its 
traces (i.e. = x^xl_ 2 , where L — 2 = Instead of Hl, there 

are certain advantages in using STF multipole moments: indeed the STF 
moments are uniquely defined, and they often yield simpler computations in 
practice. It is not difficult, using STF techniques, to obtain the multipole 
decomposition equivalent to (17)-(18) but expressed in terms of STF tensors. 
We find 



. „ -1-00 .W j- . 

M{hn = FPb=o D~^^[{r/ro)^M{An] ~ ^ E ~ 

^ 1=0 ' ^ ^ 

(24) 

where the STF multipole moments are given by [25] 

=FPb=o J \^/ro\^XL J dz 5 i{z)t^'' { x,u + z\x\/c) . (25) 

The notation for a STF product of vectors is xl = STF(xl) (such that xl 
is symmetric in L and = 0; for instance Xij = XiXj — As 

we see, the STF moments (25) involve an extra integration, over the variable 
z, with respect to the non-STF ones (18). The weighting function associated 
with the z-integration reads, for any I, 

Si{z)=^;^{l-zy ; j'^dzS,{z) = l . (26) 

In the limit of large / the weighting function tends toward the Dirac delta 
measure (hence its name): lim;_,.oo Si = S. Remark that since (25) is valid only 
in the post-Newtonian approximation, the z-integration is to be expressed as 
a post-Newtonian series. Here is the relevant formula [30] : 
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dz <5;(z)r(x, u 



2 |x|/c) 



^^2^k\{2l + 2k+l)\\\ c du) ^ 



(27) 

In the limiting case of linearized gravity, one can neglect the first term in (24), 
and the pseudo-tensor in (25) can be replaced by the matter stress-energy 
tensor (we have inside the slowly-moving source). Since 

has a compact support the finite part prescription can be removed, and 
we recover the known multipole decomposition corresponding to a compact- 
support source (see the appendix B in [30]). 



3.2 Linearized Approximation to the Exterior Field 

Up to now we have solved the relaxed field equation (10) in the exterior zone, 
with result the multipole decomposition (24)-(25). In this section we further 
impose the harmonic gauge condition (13a), and from this we find a solution 
of the linearized vacuum equation, appearing as the first approximation in a 
post-Minkowskian expansion of the multipole expansion M.{h). 

Let us give a notation to the first term in (24): 

= FPs=o U~\r/ro)^M{An\ ■ (28) 

Applying on (24) the condition = 0, we find that the divergence 

is equal to a retarded solution of the source-free wave equation, 

given by 



”'‘ = ^3-(£Tf«'.{;^n<--'/c)}) ■ (29) 

Now, associated to any of this type, there exists some which is like 
a retarded solution of the source-free wave equation, = 0, and 

furthermore whose divergence is the opposite of = —w^. We refer 

to [23,70] for the explicit formulas allowing the “algorithmic” construction 
of once we know . For definiteness, we adopt the formulas (2.12) in 
[70], which represent themselves a slight modification of the earlier formulas 
(4.13) in [23] (see also the appendix B in [25]). 

With at our disposal we define what constitutes the linearized ap- 
proximation to the exterior metric, say Gh^'^ where we factorize out G in 
front of the metric in order to emphasize its linear character: 



( 30 ) 
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The linearized metric satisfies the linearized vacuum equations in harmonic 
gauge: = 0 since both terms in (30) satisfy the source-free wave equa- 
tion, and = 0 thanks to (29) and = —w^. Using the definition 

(30) one can re-write the multipole expansion of the exterior field as 

. (31) 

Quite naturally the and will represent the non-linear corrections to 
be added to the “linearized” metric in order to reconstruct the complete 
exterior metric (see Sect. 4). 

Since satisfies Uh^'^ = 0 = , there is a unique way to decompose 

it into the sum of a “canonical” metric introduced by Thorne [60] (see also 
[23]) plus a linearized gauge transformation, 

K'' = KLi + ■ (32) 

The canonical linearized metric is defined by 

^ l>0 ■ ^ 

i>i ■ ^ ^ 

+ (UftL-llu)]) I , (33b) 

/>2 ■ ^ ^ 

+ ^ _|_ 2 ^ ^aL -2 ) (33c) 

where the fy’s and J^s are two sets of functions of the retarded time u = 
t—r/c [the subscript (n) indicates n time derivatives], and which are STF with 
respect to all their indices L = i\ . . ,ii (the symmetrization is denoted with 
parenthesis). As for the gauge vector t/sfy it satisfies = 0 and depends in 
a way similar to (33) on four other sets of STF functions of u, denoted Wl, 
Xl, Yl and (one type of function for each component of the vector). See 
[25] for the expression of [Mfy, A/,, Ify, Z^]. 

3.3 Derivation of the Source Multipole Moments 

The two sets of multipole moments fy and Jl parametrizing the metric (33) 
constitute our definitions for respectively the mass-type and current-type 
multipole moments of the source. Actually, there are also the moments Wl, 
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Xl, Yl, Zl, and we refer collectively to {/l, Jl, VPl, as the set 

of six source multipole moments. 

With (32) it is easily seen (because = 0) that the gauge condition 
= 0 imposes no condition on the source moments except the conser- 
vation laws appropriate to the gravitational monopole I (having I = 0) and 
dipoles li, Ji {I = 1): namely, 

/(i) = 0 ; /f ^ = 0 ; ^ = 0 . (34) 

The mass monopole I and current dipole Ji are thus constant, and agree 
respectively with the ADM mass and total angular momentum of the isolated 
system (later we shall denote the ADM mass hy M = I). According to (34) 
the mass dipole A is a linear function of time, but since we assumed that the 
metric is stationary in the past, A is in fact also constant, and equal to the 
(ADM) center of mass position. 

The expressions of II and Jl (as well as of the other moments TYl, Xl, Yl, 
Zi^) come directly from (30) with (32)-(33) and the result of the matching, 
which is personified by the formula (25). To simplify the notation we define 




y. . = 

' 5 

(where r** = <5yT*^'). The result is [25] 



(35a) 

(35b) 

(35c) 



4(2/ -h 1) 






+ e^(i + 1W + 2){21 + 5) }(^. . + »|X|/C) , (36a) 

Jl{u) = £ab<iiFPB=0 J d^xjx/rol^y dz^^SiXL-l>a^b 

- ^ 2(^1 + 2){21 + 3) ^i+i^L-i>acdtYbc^{x, u + 2 :|x|/c) , (36b) 



(<> refers to the STF projection). In a sense these expressions are exact, 
since they are formally valid up to any post-Newtonian order. [See (68)-(69) 
below for explicit formulas at 2PN.] 

By replacing in (36) by the compact-support matter tensor we 
recover the expressions of the multipole moments worked out in linearized 
gravity by Damour and Iyer [78] (see also [79]). On the other hand the for- 
mulas (36) contain the results obtained by explicit implementation (“order 
by order” ) of the matching up to the 2PN order [24] . 
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4 Post-Minkowskian Approximation 

In linearized gravity, the source miiltipole moments represent also the mo- 
ments which are “measured” at infinity, using an array of detectors sur- 
rounding the source. However, in the non-linear theory, the gravitational 
source cannot be neglected and the first term in (24) plays a crucial 
role, notably it implies that the measured miiltipole moments at infinity dif- 
fer from the source moments. Thus, we must now supplement the formulas 
of the source multipole moments (36) by the study of the “non-linear” term 

in (24). For this purpose we develop fol- 
lowing [23] a post-Minkowskian approximation for the exterior vacuum met- 
ric. 



4.1 Multipolar Post-Minkowskian Iteration of the Exterior Field 

The work started already with the formulas (31)-(33), where we expressed 
the exterior multipolar metric as the sum of the “linearized” 

metric and the “non-linear” corrections given by (28), and v^'^, 

algorithmically constructed from [see (29)]. The linearized metric 

is a functional of the source multipole moments: hi = hi [/, J, W, X, Y, Z] . We 
regard G as the book-keeping parameter for the post-Minkowskian series, 
and consider that Ghi is purely of first order in G, and thus that hi itself is 
purely of zeroth order. Of course we know from the previous section that this 
is untrue, because the source miiltipole moments depend on G; supposing 
hi = 0{G^) is simply a convention allowing the systematic implementation 
of the post-Minkowskian iteration. 

Here we check that the non-linear corrections and in (31) gen- 
erate the whole post-Minkowskian algorithm of [23]. The detail demanding 
attention is how the post-Minkowskian expansions of u^’’' and are related 
to a splitting of the gravitational source into successive non-linear terms. 
Let us pose, with obvious notation, 

A^^'' = m''[h, h] + h, h] + 0{h^) , (37) 

where, from the exact formula (5), the quadratic-order piece reads (all indices 
being lowered with the Minkowski metric, and h denoting rjP'^hp^): 



m‘'[h,h] 



- hP^dl^h^^'^ + ^dPhp^d’'hP'^ - ^dPhd'^h 

- dPhp^dPh’'^ - d''hp,dPhP’^ + d^hPP{d^h'^p + dph'''^) 






-\d\hp„d^hP'^ + \dphdPh + \dph„xd'^hP^ 
A P 8 2 P 



, (38) 



and where the cubic-order piece M[h, h, h] and all higher-order terms can be 
obtained in a straightforward way. 
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First, reasoning ad absurdio, we prove (see [25] for details) that both u 
and V indeed represent non-linear corrections to the linearized metric since 
they start at order G^: u = + O(G^) and v = G‘^V2 + 0 {G^). Next we 

obtain explicitly U 2 by substituting the linearized metric h\ into (38) and 
applying the finite part of the retarded integral, i.e. 

= FPs=onfl'|(^Ao)^iV'^"[/ii,/*i]| . (39) 

In this way we have a particular solution of the wave equation in IR x IRj, 
Uu 2 = N[hi,hi\. From U 2 one deduces V 2 by the same “algorithmic” equa- 
tions as used when deducing v from u [see after (29)]. Then Ov2 = 0 and the 
sum U 2 + V 2 is divergenceless, so we can solve the quadratic-order vacuum 
equations in harmonic coordinates by posing 

= + (40) 

With this definition it is clear that the multipole expansion (31) reads to 
quadratic order: 



= Gh^’' + G^ht^'' + O(G^) . (41) 

Continuing in this fashion to the next order we find successively 



uf = FPs=onfl'|(r/ro)^(^M'^"[hi, hi, hi] + iV^"[hi, h2] + iV^"[h2, hi]^|; 

(42a) 

hr = ur+vr; (42b) 

= Gh‘1'' + G^ht^'" + G^h^‘' + O(G^) . (42c) 

This process continues ad infinitum. The latter post-Minkowskian algorithm 
is exactly the one proposed in [23] (see also Sect. 2 of [70]). That is, starting 
from hi[I,J,W,X,Y,Z] given by (32)-(33), one generates the infinite post- 
Minkowskian (MPM) series of [23] , solving the vacuum (harmonic-coordinate) 
Einstein equations in JR x IRl, and this formal series happens to be equal, 
term by term in G, to the general miiltipole decomposition of given by 
(24). For any n, we have + v^'', and 



+00 

7W(h^") = ^G"h([" . (43) 

n— 1 

This result is perfectly consistent with the fact that the MPM algorithm 
generates the most general solution of the field equations in M x Fur- 
thermore, the latter post-Minkowskian approximation is known [26] to be 
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reliable (existence of a one-parameter family of exact solutions whose Taylor 
expansion when G — >■ 0 reproduces the approximation) - an interesting result 
which indicates that the multipole decomposition A4(h) given by (24)-(25) 
might be proved within a context of exact solutions. 

Recall that the source multipole moments II, Jl, Wl, X^, Y^, enter- 
ing the linearized metric hi at the basis of the post-Minkowskian algorithm 
are given by formulas like (36). Thus, in the present formalism, the source 
moments, including formally all post-Newtonian corrections [and all possible 
powers of G] as contained in (36), serve as “seeds” for the post-Minkowskian 
iteration of the exterior field, which as it stands leads to all possible non- 
linear interactions between the moments. As we can imagine, rapidly the 
formalism becomes extremely complicated when going to higher and higher 
post-Minkowskian and/or post-Newtonian approximations. Most likely the 
complexity is not due to the formalism but reflects the complexity of the 
field equations. It is probably impossible to find a different formalism in 
which things would be much simpler (except if one restricts to a particular 
type of source) . 

4.2 The “Canonical” Multipole Moments 

The previous post-Minkowskian algorithm started with hi, a functional of 
six types of source multipole moments, II and Jl entering the “canonical” 
linearized metric /icani given by (33), and W^, Y^, Zj^ parametrizing the 

gauge vector in (32). All these moments deserve their name of source mo- 
ments, but clearly the moments VFl, A^, Yj^ and Zj, do not play a physical 
role at the level of the linearized approximation, as they simply parametrize a 
linear gauge transformation. But because the theory is covariant with respect 
to (non-linear) diffeomorphisms and not merely to linear gauge transforma- 
tions, these moments do contribute to physical quantities at the non-linear 
level. 

In practice, the presence of the moments Wl, Xl, Yl, Zl complicates the 
post-Minkowskian iteration. Fortunately one can take advantage of the fact 
(proved in [23]) that it is always possible to parametrize the vacuum metric by 
means of two and only two types of multipole moments Mj^ and S ^ (different 
from II and Jl). The metric is then obtained by the same post-Minkowskian 
algorithm as in (39)-(43), but starting with the “canonical” linearized metric 
^cani [M, S'] instead of hi [I, J, W, X, Y, Z] . The resulting non-linear metric 
/lean is isometric to our exterior metric /lext = AI(/i), provided that the 
moments Ml and S l are given in terms of the source moments II, Jl, ■ ■ ■ , Zl 
by some specific relations 



Ml = Ml[I,J,W,X,Y,Z] , 
Sl = Sl[I,J,W,X,Y,Z] . 



(44a) 

(44b) 
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The two coordinate systems in which hcan and /lext are defined satisfy the 
harmonic gauge condition in the exterior zone, but (probably) only the one 
associated with hext meshes with the harmonic coordinates in the interior 
zone. With the notation (32) the coordinate change reads Sx^ = G(pi+ non- 
linear corrections. We shall refer to the moments M/, and as the mass- 
type and current-type canonical multipole moments. Of course, since at the 
linearized approximation the only “physical” moments are II and Jl, we 
have 



where 0{G) denotes the post-Minkowskian corrections. Furthermore, it can 
be shown [73] that in terms of a post-Newtonian expansion the difference 
between both types of moments is very small: 2.5PN order, i.e. 



[note that M = Madm = !]■ Thus, from (46), the canonical moments are 
only “slightly” different from the source moments. Their usefulness is merely 
practical - in general they are used in place of the source moments to simplify 
a computation. 

4.3 Retarded Integral of a Multipolar Extended Source 

The previous post-Minkowskian algorithm has only theoretical interest unless 
we supply it with some explicit formulas for the computation of the coeffi- 
cients hn- Happily for us pragmatists, such formulas exist, and can be found 
in a rather elegant way thanks to the process of analytic continuation. Ba- 
sically we need the retarded integral of an extended (non-compact-support) 
source with a definite multipolarity 1. Here we present three exemplifying 
formulas; see the appendices A in [70] and [71] for more discussion. 

Very often we meet a wave equation whose source term is of the type 
fiLF{t — r lc)lr^ , where fi^ has multipolarity I and F denotes a certain prod- 
uct of multipole moments. [Clearly, the near-zone expansion of such a term is 
of the form (15).] When the power k is such that 3 < k < 1 + 2 (this excludes 
the scalar case / = 0), we obtain the solution of the wave equation as [23,68] 



Ml = Il + 0{G) , 
Sl = Jl + 0{G) , 



(45a) 

(45b) 




(46) 



{r/ro)^^F{t- r/c) 




k-3 



(A:-3)!(/ + 2- A:)!. 
“ {l + k-2)\ 

(I + j)\ {t — r / c) 



( 47 ) 
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As we see the (finite part of the) retarded integral depends in this case on 
the values of the extended source at the same retarded time t — r/c (for 
simplicity we use the same notation for the source and field points). But it 
is well known (see e.g. [80,81]) that this feature is exceptional; in most cases 
the retarded integral depends on the whole integrated past of the source. A 
chief example of such a “hereditary” character is the case with fc = 2 in the 
previous example, for which we find [68,69] 

= -^ j dsF{s/c)Qi( ^^ ^ (48) 

where Qi denotes the Legendre function of the second kind, related to the 
usual Legendre polynomial Pi by the formula 

Qi{x) = ^Pi{x)ln ■ ( 49 ) 

Since the retarded integral (48) is in fact convergent when r — 0, we have 
removed the factor (r/ro)® and finite part prescription. When the source 
term itself is given by a “hereditary” expression such as the right side of 
(48), we get a more complicated but still manageable formula, for instance 
[71] 



□ 



R 



riL 



F{t — r/c) 



□ 



-1 



^ / dsF{s/c)Qf 



ct — s 




(50) 

where F^ denotes that anti-derivative of F which is zero in the past [from 
(9) we have restricted F to be zero in the past], and where 



Rip{x) = Qi{x) dyQp{y)^{y) + Pi{x) J dyQp{y)'^{y) . (51) 

Like in (48) we do not need a finite part operation. The function Rip is 
well-defined thanks to the behaviour of the Legendre function at infinity: 
Qi{x) ~ when x — >■ oo. 

The formulas (48)-(51) are needed to investigate the so-called tails of 
gravitational waves appearing at quadratic non-linear order, and even the 
tails generated by the tails themselves (“tails of tails”) which arise at cubic 
order [69,71]. (These formulas do not show a dependence on the constant tq, 
but other formulas do.) 

5 Radiative Multipole Moments 

In Sect. 2 we introduced the definition of a set of multipole moments 
{Il,Jl,Wl,Xl,Yl,Zl} for the isolated source, and in Sect. 3 we showed 
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that the exterior field, and in particular the asymptotic field therein, is 
actually a complicated non-linear functional of the latter moments. There- 
fore, to define some source multipole moments is not sufficient by itself; 
this must be completed by a study of the relation between the adopted 
definition and some convenient far- field observables. The same is true of 
other definitions of source moments in different formalisms, such as in 
the Dixon local description of extended bodies [82-84], which should be 
completed by a connection to the far-zone gravitational field, for instance 
along the line proposed by [85,86] in the case of the Dixon moments. In the 
present formalism, the connection rests on the relation between the so-called 
radiative multipole moments, denoted and 4/,, and the source moments 
II, Jl,- ■ Zl [in fact, for simplicity’s sake, we prefer using the two moments 
Ml and Sl instead of the more basic six source moments]. 

5.1 Definition and General Structure 

The radiative moments Ul (mass-type) and Vl (current-type) are the co- 
efficients of the multipolar decomposition of the leading 1/i? part of the 
transverse-tracefree (TT) projection of the radiation field in radiative coor- 
dinates (T, X) (with R = |X| the radial distance to the source). Radiative 
coordinates are such that the metric coefficients admit an expansion when 
i? — >■ oo in powers of 1 /R (no logarithms of R) . In radiative coordinates the 
retarded time T — R/c is light-like, or becomes asymptotically light-like when 
i? — >■ oo. By definition, 



hl^{y.,T) = NL-2UabL-2 



l>2 



21 



c{l + l)' 



^cL-2£cd{aV^b)dL-2 f + ^ ( 



where Ni = / R, Nl -2 = Ni„ . . . Nl_^, Ncl -2 = XcNl- 2 , and the TT al- 

gebraic projector reads V,jab = {Sia-N,Na){Sjb-NjNb)-^{6ij-NiNj){Sab- 
NaNb). The radiative moments Ul and Vl depend on T — R/c; from (52) 
they are defined W I > 2. The radiative-coordinate retarded time differs from 
the corresponding harmonic-coordinate time by the well-known logarithmic 
deviation of light cones. 



T-^=f-^-?^ln 

C C 



-]+0{G^) , 

ro,' 



(53) 



where we have introduced in the logarithm the same constant rg as in (39) 
(this corresponds simply to a choice of the origin of time in the far zone). 

Now from the post-Minkowskian algorithm of Sect. 3, it is clear that the 
radiative moments Ul and Vl can be obtained to any post-Minkowskian order 
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in principle, in the form of a non-linear series in the source or equivalently the 
canonical multipole moments Ml and Sl- The practical detail (worked out 
in [29]) is to determine the transformation between harmonic and radiative 
coordinates, generalizing (53) to any post-Minkowskian order. The structure 
of e.g. the mass-type radiative moment is 



Ul 



M 



(0 



+ 00 

E: 

n—2 



,3(n-l)+2fc^"'^ ■ 



(54) 



The first term comes from the fact that the radiative moment reduces at the 
linearized approximation to the (Zth time derivative of the) source or canoni- 
cal moment. The second term represents the series of non-linear corrections, 
each of them is given by a certain X^l which is a n-linear functional of deriva- 
tives of multipole moments Ml or Sl- Furthermore we know from e.g. (48) 
and (50) that each new non-linear iteration (which always involves a retarded 
integral) brings a priori a new “hereditary” integration with respect to the 
previous approximation. So we expect that X„l is of the form (JJ = T — R/c) 



XnL{U)='^f dui...f dUnZn(U,Ui,...,Un)M]^^\ui)...S^^^\un) 

J —CO j —CO 

(55) 

where Z„ denotes a certain kernel depending on time variables U.,ui, . . . , rt„, 
and where the sum refers to all possibilities of coupling together the n mo- 
ments. [See (56) below for examples of kernels Z 2 and Z 3 .] A useful infor- 
mation is obtained from imposing that Zn be dimensionless; this yields the 
powers of G and 1/c in front of each non-linear term in (54), where k is the 
number of contractions among the indices present on the n moments (the 
current moments carrying their associated Levi-Civita symbol) . 

As an example of application of (54) let us suppose that one is interested 
in the 3PN or 1/c® approximation. From (54) we have 3(n — 1) -I- 2A: = 6, and 
we deduce that the only possibility is n = 3 (cubic non-linearity) and k = 0 
(no contractions between the moments). From this we infer immediately that 
the only possible multipole interaction at that order is between two mass 
monopoles and a multipole, i.e. M x M x Ml- This corresponds to the “tails 
of tails” computed explicitly in (56) below. 



5.2 The Radiative Quadrupole Moment to 3PN Order 

To implement the formula (54) a tedious computation is to be done, following 
in details the post-Minkowskian algorithm of Sect. 4 augmented by explicit 
formulas such as (47)-(51), and changing the coordinates from harmonic to 
radiative according to the prescription in [29]. Here we present the result of 
the computation of the mass-type radiative quadrupole {I = 2) up to the 
3PN order: 
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(56) 



Recall that in this formula the moment Mij is the canonical moment which 
agrees with the source moment up to a 2.5PN term [see (46)], and that the 
source moment itself is given in terms of the pseudo-tensor of the source 
by (36a). See also the formulas (68)-(69) below for a more explicit expression 
of the source moment at the 2PN order [of course, to be consistent, one should 
use (56) conjointly with 3PN expressions of the source moments]. 

The “Newtonian” term in (56) corresponds to the quadrupole formalism. 
Next, there is a quadratic non-linear correction with multipole interaction 
M X Mij representing the dominant effect of tails (scattering of linear waves 
off the space-time curvature generated by the mass M). This correction, com- 
puted in [69], is of order \/(? or 1.5PN and has the form of a hereditary inte- 
gral with logarithmic kernel. The constant 11/12 depends on the coordinate 
system chosen to cover the source, here the harmonic coordinates; for in- 
stance the constant would be 17/12 in Schwarzschild-like coordinates [87,88]. 
The next correction, of order 1/c^ or 2.5PN, is constituted by quadratic in- 
teractions between two mass quadrupoles, and between a mass quadrupole 
and a constant current dipole [70]. This term contains a hereditary integral, 
of a type different from the tail integral, which is due to the gravitational ra- 
diation generated by the stress-energy distribution of linear waves [89-91,69]. 
Sometimes this integral is referred to as the non-linear memory integral be- 
cause it corresponds to the contribution of gravitons in the so-called linear 
memory effect [92]. The non-linear memory integral can easily be found by 
using the effective stress-energy tensor of gravitational waves in place of the 
right side of (3); it follows also from rigorous studies of the field at future 
null infinity [93,94]. Finally, at 3PN order in (56) appears the dominant cu- 
bic non-linear correction, corresponding to the interaction M x M x Mij and 
associated with the tails of tails of gravitational waves [71]. 



5.3 Tail Contributions in the Total Energy Flux 

Observable quantities at infinity are expressible in terms of the radiative mass 
and current multipole moments. For instance the total gravitational-wave 
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power emitted in all spatial directions (total gravitational flux or “luminosity” 
C) is given by the positive-deflnite multipolar series 




(^ + l)(^ + 2) jTC^)rj{l) 

l{l - 1)U{21 + iy.\ ^ ^ 

4l{l + 2) 

^c2(/- 1)(/ + 1)!(2/ + 1)!! 



^(1)^(1) 



(57) 



In the case of inspiralling compact binaries (a most prominent source of grav- 
itational waves) the rate of inspiral is fixed by the flux C, which is therefore 
a crucial quantity to predict. Excitingly enough, we know that C should be 
predicted to 3PN order for detection and analysis of inspiralling binaries in 
future experiments [95,96]. 

To 3PN order we can use the relation (56) giving the 3PN radiative 
quadrupole moment. Here we concentrate our attention on tails and tails of 
tails. The dominant tail contribution at 1.5PN order yields correspondingly 
a contribution in the total flux (with U = T — R/c): 
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(58) 



Since we are interested in the dominant tail we have replaced using (46) 
the canonical mass quadrupole by the source quadrupole. Similarly there are 
some tail contributions due to the mass octupole, current quadrupole and all 
higher-order multipoles, but these are correlatively of higher post-Newtonian 
order [see the factors 1/c in (57)]. It has been shown [68] that the work done 
by the dominant “hereditary” contribution in the radiation reaction force 
within the source - which arises at 4PN order in the equations of motion - 
agrees exactly with (58). 

Next, because C is made of squares of (derivatives of) radiative moments, 
it contains a term with the square of the tail integral at 1.5PN. This term 
arises at the 3PN relative order and reads 
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Finally, there is also the direct 3PN contribution of tails of tails in (56): 
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(60) 
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By a control of all the hereditary integrals in C up to 3PN we have checked 
[71] that the terms (59)-(60) do exist. The two coirtributions (59) and (60) 
appear somewhat oir the same footing - of course both should be taken iirto 
accouirt in practical computations. Note that in a physical situatioir where the 
emission of radiation stops after a certain date, in the sense that the source 
multipole moments become constant after this date (assuming a consistent 
matter model which would do this at a given post-Newtonian order), the 
only contribution to C which survives after the end of emission is the 3PN 
tail-square contribution (59). 

6 Post-Newtonian Approximation 

In Sects. 2 and 3 we have reasoned upon the formal post-Newtonian expan- 
sion of the near-zone field to obtain the source multipole moments as 
functionals of the post-Newtonian pseudo-tensor . We have also consid- 
ered in Sects. 4 and 5 the formal expansion c — >■ oo of the radiation field 
wheir holdiirg the multipole moments fixed. Clearly missing in this scheme is 
an explicit algorithm for the computatioir of iir the irear zoire. No such 
algorithm (say, in the spirit of the post-Miirkowskiair algorithm iir Sect. 4) is 
known presently, but a lot is knowir oir the first few post-Newtoniair iterations 
[20,21,44-58,30,24]. 

The maiir difficulty in settiirg up a post-Newtonian algorithm is the ap- 
pearance at some post-Newtonian order of divergent Poisson- like integrals. 
This comes from the fact that the post-Newtonian expansion is actually a 
near-zone expansion [44], which is valid only in the region where r = 0{X/c), 
and that such an expansion blows up when taking formally the limit r — ^ -|-oo. 
For instance, Rendall [13] has shown that the post-Newtonian solution can- 
irot be asymptotically flat starting at the 2PN or 3PN level, depeirding on the 
gauge. This is clear from the structure of the exterior near-zone expansion 
(15), which involves many positive powers of the radial distance r. Thus, one 
is not allowed in geireral to consider the limit r — ^ -|-oo. Iir consequence, using 
the Poisson integral for solving a Poisson equation with non-compact-support 
source at a given post-Newtonian order is a priori meaningless. Indeed the 
Poisson integral not only extends over the near-zone but also over the regions 
at infinity. This means that the Poisson integral does not constitute the cor- 
rect solution of the Poisson equation in this context. However, to the lowest 
post-Newtonian orders it works; for instance it was shown by Kerlick [50,51] 
and Caporali [52] that the post-Newtonian iteration (including the sugges- 
tion by Fillers [48,49] of an improvement with respect to previous work [55]) 
is well-defined up to the 2.5PN order where radiation reaction terms appear, 
but that some divergent integrals show up at the 3PN order. 

Another difficulty is that the post-Newtonian approximation is in a sense 
not self-supporting, because it necessitates information coming from outside 
its own domain of validity. Of course we have in mind the boundary condi- 
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tions at infinity which determine the radiation reaction in the source’s local 
equations of motion. Again, to the lowest post-Newtonian orders one can 
circumvent this difficulty by considering retarded integrals that are formally 
expanded when c — ?► oo as series of “instantaneous” Poisson-like integrals 
[55]. However, this procedure becomes incorrect at the 4PN order, not to 
mention the problem of divergencies, because the near-zone field (as well as 
the source’s dynamics) ceases to be given by an instantaneous functional of 
the source parameters, due to the appearance of “tail-transported” hereditary 
integrals modifying the lowest-order radiation reaction damping [68,32]. 

Let us advocate here that the cure of the latter difficulty (and perhaps 
of all difficulties) is the matching equation (16). Indeed suppose that one 
knows a particular solution of the Poisson equation at some post-Newtonian 
order. This solution might be in the form of some “finite part” of a Pois- 
son integral. The correct post-Newtonian solution will be the sum of this 
particular solution and of a homogeneous solution satisfying the Laplace 
equation, namely a harmonic solution, regular at the origin, which can al- 
ways be written in the form for some unknown constant tensors 

Al. The homogeneous solution is associated with radiation reaction effects. 
Now the matching equation states that the multipole expansion of the post- 
Newtonian solution agrees with the near-zone expansion of the exterior field 
(which has been computed beforehand in Sect. 4). The multipole expansion 
of the known particular solution can be obtained by a standard method, 
and the multipole expansion of the homogeneous solution is simply itself, i.e. 
M(Y ^lXl) = Y^lXl- Therefore, we see that the matching equation de- 
termines in principle the homogeneous solution (i.e. all the unknown tensors 
Ai), and since the exterior field satisfies relevant boundary conditions at in- 
finity, the Ai’s should correspond to the radiation reaction on a truly isolated 
system. See [67,68,31,32] for implementation of this method to determine the 
radiation reaction force to 4PN order (1.5PN relative order). 

6.1 The Inner Metric to 2.5PN Order 

Going to high post-Newtonian orders can become prohibitive because of the 
rapid proliferation of terms. Typically any allowed term (compatible dimen- 
sion, correct index structure) does appear with a definite non-zero coefficient 
in front. However, high post-Newtonian orders can be manageable if one 
chooses some appropriate matter variables, and if one avoids expanding sys- 
tematically the retardations due to the speed of propagation of gravity. Often 
it is sufficient, and clearer, to present a result in terms of matter variables still 
containing some c’s, and perhaps also in terms of some convenient retarded 
potentials (being clear that any retardation going to an order higher than 
the prescribed post-Newtonian order of the calculation is irrelevant). See for 
instance (65) and (68)-(69) below. Anyway, only in a final stage, when a re- 
sult to the prescribed order is in hands, should we introduce the more basic 
matter variables (e.g. the coordinate mass density) and perform all necessary 
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retardations. Then of course one does not escape to a profusion of terms, but 
at least we have been able to carry the post-Newtonian iteration using some 
reasonably simple expressions. 

The matter variables are chosen [30,24] in a way consistent with our earlier 
definitions (35), i.e. 



^00 _|_ 'Y'ii 

a = ^ ; (61a) 

ai = ; (61b) 

c 

. (61c) 



To 2.5PN order one defines some retarded potentials V, Vi, Wij, X and Ri, 
with V and Vi looking like some retarded versions of the Newtonian and 
gravitomagnetic potentials, and Wij being associated with the matter and 
gravitational- field stresses: 



V = {— 47rGcr} , 


(62a) 


V = {-47rGcrJ , 


(62b) 


Wij = {-47rG(cr^ - Sijakk) ~ diVdjV} , 


(62c) 


4 = < 


'^-WG{Va, - Va) - 2duVdWk ~ ^dtVdwj , 


(62d) 


><> 

III 
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-47rGPcr,, -k 2VidtdiV + Vd\V 






+ ^{dtV)^ - 2dWjdjV + Wjdf^V^ , 


(62e) 



where denotes the retarded integral (11). All these potentials but V and 
Vi have a spatially non-compact support. The highest non-linearity entering 
them is cubic; it appears in the last term of X. 

Based on the latter potentials one can show [24,74] that the inner metric 
to order 2.5PN (in harmonic coordinates, d,y{y/—gg^'^) = 0) takes the form 
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500 = -1 + - ^V^ + 
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(63a) 

(63b) 

(63c) 



(writing g^,^ would be more consistent with the notation of Sect. 2). With 
this form, we believe, the computational problems encountered in applica- 
tions are conveniently divided into the specific problems associated with the 
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computation of the various potentials (62), which constitute in this approach 
some appropriate computational “blocks” (having of course no physical sig- 
nification separately) . By expanding all powers of 1 /c present into the mat- 
ter densities (61) and into the retardations of the potentials (62), we find 
that the metric (63) becomes extremely complicated, as it really is (see e.g. 
[46,47,50,51]). 

Because of our use of retarded potentials, the metric (63) involves explic- 
itly only even post-Newtonian terms (using the post-Newtonian terminology 
that even terms correspond to even powers of 1/c in the equations of mo- 
tion) . We have checked [24] that the odd post-Newtonian terms (responsible 
for radiation reaction), contained in (63) via the expansion of retardations, 
match, in the sense of the equation (16), to the exterior metric satisfying the 
no-incoming radiation condition (9). 

The harmonic gauge condition implies some differential equations to be 
satisfied by the previous potentials. To 2.5PN order we find 
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, (64a) 
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where Wu = SijWij. These equations are in turn equivalent to the equation 
of continuity and the equation of motion for the matter system. 



dtcr + d^(Ji 



dt<Ji + djaij 



^ {dtCTii - adtV) + O , 

aar + o(^l[) . 



(65a) 

(65b) 



Note that the precision is IPN for the equation of continuity but only New- 
tonian for the equation of motion. 



6.2 The Mass- Type Source Moment to 2.5PN Order 

From the 2.5PN metric (63) we obtain the pseudo-tensor r and the auxiliary 
quantities (35), that we replace into the formulas (36) to obtain the 2.5PN 
source multipole moments. Recall that the ^-integration in the moments is 
to be carried out using the formula (27). Let us first see how this works at 
the IPN order. 

We need S to IPN order and Si to Newtonian order. The latter quantity 
reduces to the matter part, St = ai + 0{l/c?), and the former one reads after 
a simple transformation 
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The substitution into the moments II given by (36a) leads to 



II = FPs=o 



+ 



^ (fix |x/ro|^|xiCr- 

|xpXL 2 4(2/ -I- l)xiL „ 

2c2(2/ + 3) c2(/ + l)(2/ + 3) 




(67) 



The integrand is non-compact-supported because of the contribution of the 
second term, and accordingly we keep the regularization factor |x/ro|^ and 
finite part operation. But let us operate by parts the second term, using the 
fact that |x|'®XL/i(P^) - A{\x\^xl)V^ = di{\x\^XLdi{V‘^) - di{\x\^ xl)V‘^} 
is a pure divergence. When the real part of i? is a large negative number, 
we see thanks to the Gauss theorem that the latter divergence will not 
contribute to the moment, therefore by the unicity of the analytic contin- 
uation it will always yield zero contribution. Thus, using Axi^ = 0, we 
can replace \x\^ xlA{V'^) in the second term of (67) by Z\(|x|^5;l)17^ = 
B{B + I + l)|x|^“ 2 ^^y 2 ^ because of the explicit factor B we see that 
the second term can be non-zero only in the case where the factor B multi- 
plies an integral owning a simple pole ^ 1 j B due to the integration bound 
|x| — >■ oo. Expressing 1/2 Newtonian order) in terms of source points Zi 
and Z 2 , we obtain the integral / (fix |x|^“2fj^|x — zi|“^|x — Z 2 |“^. When 
|x| — >• oo each |x — zi^ 2 |~^ can be expanded as a series of hij 3 |x|“*c 2 -i- 
performing the angular integration shows that the sum of “multipolarities” 
I + I 1 + I 2 is necessarily an even integer. When this is realized the remaining 
radial integral reads / d\x\ |x|^+*“b-b -2 develops a pole only when 

I — li — I 2 — 2 = —1. But that is incompatible with the previous finding. 
Thus the second term in (67) is identically zero, and we end up simply with 
a compact-support expression on which we no longer need to implement the 
finite part. 



II 




XLO-+ 



|xpX_L „2 4(2/ -I- l)xiL „ 

2c2(2/ + 3) c2(/ + l)(2/ + 3) 




■ ( 68 ) 



This expression was first obtained in [30] using a different method valid at 
IPN order. Here we have recovered the same expression from the formula 
(36a) valid to any post-Newtonian order [24,25]. 

Only starting at the 2PN order does the mass multipole moment have a 
non-compact support (so the finite part becomes crucial at this order). By a 
detailed computation in [24] we arrive at the following 2PN (or rather 2.5PN) 
expression: 
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hit) = 




Recall that the canonical moment Ml differs from the source moment II 
precisely the 2.5PN order [see (46)]. 

7 Point-Particles 

So far the post-Newtonian formalism has been developed for smooth (i.e. C°°) 
matter distributions. As such, the source multipole moments (36) become ill- 
defined in the presence of singularities. We now argue that the formalism is 
in fact also applicable to singular sources (notably point-particles described 
by Dirac measures) provided that we add to our other basic assumptions a 
certain method for removing the infinite self-field of point-masses. Our main 
motivation is the inspiralling compact binary - a system of two compact 
objects (neutron stars or black holes) which can be described with great 
precision by two point-particles moving on a circular orbit, and whose orbital 
phase evolution should be computed prior to gravitational-wave detection 
with relative 3PN precision [95,96]. 

For this application we restrict ourselves to two point-masses mi and 
7712 (constant Schwarzschild masses). The trajectories are yi(t) and y 2 (t) 
and the coordinate velocities vi _2 = dyi, 2 /dt] we pose Vi 2 = (c, vi_ 2 )- The 
symbol 1 o 2 means the same term but with the labels of the two particles 
exchanged. A model for the stress-energy tensor of point-masses (say, at 2PN 
order) is 




(x,t) =/ii(t)?;]'(tX(t)(5[x-yi(f)]-hlo2 ; 



(70a) 

(70b) 



^i(t) = 
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where S denotes the three-dimensional Dirac measure, and the metric co- 
efficients in harmonic coordinates (g = det^^j/). The notation (ggfi,u)i means 
the value at the location of particle 1. However, due to the presence of the 
Dirac measure at particles 1 and 2, the metric coefficients will be singular 
at 1 and 2. Therefore, we must supplement the model (70) by a method of 
“regularization” able to give a sense to the ill-defined limit at 1 or 2. H pri- 
ori the choice of one or another regularization constitutes a fully-qualified 
element of the model of point-particles. In the following we systematically 
employ the Hadamard regularization, based on the Hadamard “partie finie” 
of a divergent integral [97,98] . 

Let us discuss an example. The “Newtonian” potential U, defined by 
U = 47tGct), where a is given by (61a) [we have V = U 0(l/c^)j, 

follows from (70a) as 



U = 



Gpi 

ri 




+ 1 O 2 , 



(71) 



where ri = |x — yi|. To Newtonian order U = Gmi/ri-|-0(l/c^)-|-l O 2. We 
compute 17 at the IPN order: from (70b) we deduce at this order pi/rm = 
1 — (C/)i/c^ -I- v\j2(? 7 0(l/c'^), which involves V itself taken at point 1, but 
of course this does not make sense because U is singular at 1 and 2. Now, 
after applying the Hadamard regularization (described below), we obtain 
unambiguously the standard Newtonian result {U)i = Grfi 2 lr\i + 0(l/c^), 
where ri 2 = |yi — y 2 |, that we insert back into pi. So, U at IPN, and its 
regularized value at 1, read 



U = 




{U)i 




Gm2 

ri2 

Gm\ 

ri2 



2 \ 

3 2' 
+ 2^2 




+ 1 O 2 , 



(72a) 

(72b) 



7.1 Hadamard Partie Finie Regularization 

We consider the class of functions of the field point x which are smooth 
on IR^ except at the location of the two source points yi, 2 , around which 
the functions admit some power-like expansions in the radial distance ri = 
|x — yi|, with fixed spatial direction Ui = (x — yi)/ri (and idem for 2). Thus, 
for any F(x) in this class, we have 



^ = E^?A(a) (ni) 

a 


(when ri — >• 0) ; 


(73a) 


^=E^2/2(a)(n2) 


(when r 2 — >■ 0) , 


(73b) 



a 
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where the summation index a ranges over values in Z bounded from below, 
a > —at) (we do not need to be more specific), and where the coefficients of the 
various powers of ri _2 depend on the spatial directions ni^ 2 - In (73) we do not 
write the remainders for the expansions because we don’t need them; simply, 
we regard the expansions (73) as listings of the various coefficients /i(„) and 
/ 2 (a)- We assume also that the functions F in this class decrease sufficiently 
rapidly when |x| — >• 00 , so that all integrals we consider are convergent at 
infinity. 

The integral f d^xF is in general divergent because of the singular be- 
haviour of F near yi, 2 , but we can compute its partie finie (Pf) in the sense 
of Hadamard [97,98] . Let us consider two volumes surrounding the two sin- 
gularities, of the form ri < spi(ni) (and similarly for 2), where s measures 
the size of the volume and pi gives its shape as a function of the direction 
111 (pi = 1 in the case of a spherical ball). Using (73) it is easy to deter- 
mine the expansion when s — >■ 0 of the integral extending on JR^ deprived 
from the two previous volumes, and then to subtract from the integral all the 
divergent terms when s — >■ 0 in the latter expansion. The Hadamard partie 
finie is defined to be the limit when s — >■ 0 of what remains. As it turns out, 
the result can be advantageously re-expressed in terms of an integral on JR^ 
deprived from two spherical balls (pi ,2 = 1), at the price of introducing two 
constants Si 2 which depend on the shape of the two regularizing volumes 
originally considered. With full generality the Hadamard partie finie of the 
divergent integral reads 



Pf 




a+3<-l 
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d^x F 



df^ifi(a) + In 




where Si is given by 



+ 1 




(74) 



In Si = 



/df2i/i(_3) In Pi 



(75) 



/ df^i/i(-3) 

Because of the two arbitrary constants si ^2 the Hadamard partie finie is am- 
biguous, and one could think a priori that there is no point about defining a 
divergent integral by means of an ambiguous expression. Actually the point 
is that we control the origin of these constants: they come from the coeffi- 
cients of l/ri 2 in the expansions of F, which generate logarithmic terms in 
the integral. As we shall see the constants si ,2 do not appear in the post- 
Newtonian metric up to the 2.5PN order (they are expected to appear only 
at 3PN order). 

We can also give a meaning to the value of the function F at the location 
of particle 1 for instance, by taking the average over all directions rii of the 
coefficient of the zeroth power of ri in (73a), namely 
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(^)i = j -^Mo) ■ 

We refer also to the definition (76) as the Hadamard partie finie (of the 
function F at 1) because this definition is closely related to the definition 
(74) of the Hadamard partie finie of a divergent integral. To see this, apply 
(74) to the case where the function F is actually a gradient, F = diG, where 
G satisfies (73) [it is then clear that F itself satisfies (73)]. We find 

Pf J (fix diG = —4Tr{n\rlG)i — 4Tr{n2rlG)2 (77) 

where in the right side the values at 1 and 2 are taken in the sense of the 
Hadamard partie finie (76). This nice connection between the Hadamard 
partie finie of a divergent integral and that of a singular function is clearly 
understood from applying the Gauss theorem on two surfaces ri 2 = s sur- 
rounding the singularities (there is no dependence on the constants 51 , 2 )- 

7.2 Multipole Moments of Point-Mass Binaries 

To compute the source moments (36) of two point-particles we insert (70) in 
place of the stress-energy tensor of a continuous source, and we pick up 
the Hadamard partie finie [in the sense of (74)] of all integrals. This ansatz 
reads 



(.^L)point-mass = Pf |.^L[Tpoint-mass]| i (78a) 

(TL)point-mass = Pf |</L[Tpoint-mass]} ' (78b) 

As we have seen in (69), the source multipole moments involve at high 
PN order many (non-compact-support) non-linear contributions which can be 
expressed in terms of retarded potentials such as V. The paradigm of such 
non-linear contributions is a term involving the quadratic product of two 
(derivatives of) potentials V, say dVdV, or, neglecting 0(l/c^) corrections, 
dUdU . To Newtonian order U is given by Gmxjrx F Gto 2/?'2 and it is easily 
checked that this paradigmatic term can be written as a certain derivative 
operator, say 99, acting on the elementary integral (assuming for simplicity 
I = 2 ) 



Yij(yx,y2) = ~^FFb=o f |x/ro|^^^ . (79) 

27t j rxT2 

We see that the integral would be divergent at infinity without the finite part 
operation. However, it is perfectly well-behaved near 1 and 2 where there is 
no need of a regularization. The integral (79) can be evaluated in various 
ways; the net result is [24,72] 
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r^J- 3 



Vi 



<ij> 









(80) 



where < ij >= STF(ij). Starting at 3PN order we meet some elementary 
integrals which need the regularization at 1 or 2 in addition to involving the 
finite part at infinity. An example is 



^d(yi) = J d'x |x/ro|«^| . (81) 

To obtain this integral one splits it into a near-zone integral extending over 
the domain ri < TZ\ (say), and a far-zone integral extending over TZ\ < ri. 
The Hadamard regularization at 1 applies only to the near-zone integral, 
while the finite part at B = 0 is needed only for the far-zone integral. The 
result, found to be independent of the radius TZi, reads [77] 



Zij — 




16 

l5 



<ij> 

yi 



(82) 



In this case we find an explicit dependence on both the constants tq due to 
the finite part at infinity, and si due to the Hadamard partie finie near 1 [see 
(74)]. However these constants do not enter the multipole moments before 
the 3PN order (collaboration with Iyer and Joguet [77]). 

A long computation, done in [72], yields the mass-type quadrupole mo- 
ment at the 2PN order fully reduced in the case of two point-masses moving 
on a circular orbit. The method is to start from (69) (issued from [24]) and to 
employ notably the elementary integral (79)-(80) (see also [72] for the treat- 
ment of a cubically non-linear term). An equivalent result has been obtained 
by Will and Wiseman using their formalism [75]. In a mass-centered frame 
the moment is of the form 



hj - + B + O , (83) 

where yi = y\ — 2/2 ~ '*^21 where uj denotes the binary’s Newtonian 

orbital frequency [w^ = Gm/ri 2 with m = mi-|-TO 2 ], and where = mim 2 /rn 
is the reduced mass. The point is to obtain the coefficients A and B developed 
to 2PN order in terms of the post-Newtonian parameter 7 = Gm/ri 2 C^ , 
where we recall that ri 2 is the distance between the two particles in harmonic 
coordinates. Untill 2PN we find some definite polynomials in the mass ratio 
12 = H I'm (such that 0 < v < 1/4): 



A = I -I- 7 



B = 




1 

42 “ 
11 





1607 

" 37 ^ 



461 18395^ 241 , 

1512 ~ 1512 1512^^ 

1681 229 n' 

V -\ . 

378 378 



(84a) 



(84b) 
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The 2PN mass quadrupole moment (83)-(84) is part of a program aiming 
at computing the orbital phase evolution of inspiralling compact binaries to 
high post-Newtonian order (see Sect. 7.4). First-order black-hole perturba- 
tions, valid in the test-mass limit z/ — )► 0 for one body, have already achieved 
the very high 5.5PN order [87,99-101]. Recovering the result of black-hole 
perturbations in this limit constitutes an important check of the overall for- 
malism. For the moment it passed the check to 2.5PN order [72,73]; this is 
quite satisfactory regarding the many differences between the present ap- 
proach and the black-hole perturbation method. 

7.3 Equations of Motion of Compact Binaries 

The equations of motion of two point-masses play a crucial role in accounting 
for the observed dynamics of the binary pulsar PSR1913-I-16 [1-3,22], and 
constitute an important part of the program concerning inspiralling compact 
binaries. The motivation for investigating rigorously the equations of motion 
came in part from the salubrious criticizing remarks of Jiirgen Fillers et al 
[7]. Four different approaches have succeeded in obtaining the equations of 
motion of point-mass binaries complete up to the 2.5PN order (dominant 
order of radiation reaction): the “post-Minkowskian” approach of Damour, 
Deruelle and colleagues [16-19]; the “Hamiltonian” approach of Schafer and 
predecessors [102,103,57,58] ; the “extended-body” approach of Kopejkin et al 
[104,105]; and the “post-Newtonian” approach of Blanchet, Faye and Ponsot 
[74]. The four approaches yield mutually agreeing results. 

The post-Newtonian approach [74] consists of (i) inserting the point-mass 
stress-energy tensor (70) into the 2.5PN metric in harmonic coordinates given 
by (63); (ii) curing systematically the self-field divergences of point-masses us- 
ing the Hadamard regularization; and (iii) substituting the regularized metric 
into the standard geodesic equations. For convenience we write the geodesic 
equation of the particle 1 in the Newtonian-like form 

dt 1 

where the (specific) linear momentum V\ and force T{ are given by 

Crucial in this method, the quantities are evaluated at the location of particle 
1 according to the rule (76). All the potentials (62) and their gradients are 
evaluated in a way similar to our computation of U in (72), and then inserted 
into (85)-(86). We “order-reduce” the result, i.e. we replace each acceleration, 
consistently with the approximation, by its equivalent in terms of the posi- 
tions and velocities as given by the (lower-order) equations of motion. After 
simplication we find, in agreement with other methods. 



(85) 



( 86 ) 
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dv\ 

dt 



GniD 



' 12 



^12 [4(ni2Wl) - i{ni2V2)\ 
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(87) 



[where n \2 = {y\ — V 2 )Iti 2 \ v\^ = tiI — nl; and e.g. ( 7712711 ) denotes the Eu- 
clidean scalar product] . At the IPN or 1 j(? level the equations were obtained 
before by Lorentz an Droste [20], and by Einstein, Infeld and Hoffmann [21]. 
The 2.5PN or 1/c® term represents the radiation damping in harmonic co- 
ordinates [correct because the metric (63) we started with matches to the 
post-Minkowskian exterior field]. In the case of circular orbits, the equations 
simplify drastically: 



dv\2 

dt 



■^2PN2/12 



32G^ 



m^v 






O 




where the orbital frequency 012 pn of the 2PN circular motion reads 



( 88 ) 
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2 Gm 

‘^2PN = 

'12 L 



1 + (-3 -I- p)7 -I- ( 6 -I- + v‘^ 



(89) 



(the post-Newtonian parameter is 7 = Grtij (?ry 2 ,\ and v = /r/m). 



7.4 Gravitational Waveforms of Inspiralling Compact Binaries 

The gravitational radiation field and associated energy flux are given by (52) 
and (57) in terms of time-derivatives of the radiative miiltipole moments, 
themselves related to the source multipole moments by formulas such as (56). 
Furthermore, at a given post-Newtonian order, the source moments admit 
some explicit though complicated expressions such as (68)-(69), which, when 
specialized to (non-spinning) point-mass circular binaries, yield e.g. (83)-(84). 

Now, for insertion into the radiation field and energy flux, one must com- 
pute the time-derivatives of the binary moments, with appropriate order- 
reduction using the binary’s equations of motion (87)- (89). This yields in 
particular the fully reduced (up to the prescribed post-Newtonian order) 
gravitational waveform of the binary, or more precisely the two independent 
“plus” and “cross” polarization states and h^- The result to 2PN order 
is written in the form 



(90) 

where, for convenience, we have introduced a post-Newtonian parameter 
which is directly related to the orbital frequency: x = {Gmu} 2 PN/c^)^^^, where 
W2PN is given for circular orbits by (89). The various post-Newtonian coeffi- 
cients in (90) depend on the cosine and sine of the “inclination” angle between 
the detector’s direction and the normal to the orbital plane (ci = cos i and 
Si = sini), and on the masses through the ratios v = /i/m and 5m/m, where 
Sm = mi — m 2 - The result for the “plus” polarization (collaboration with 
Iyer, Will and Wiseman [106]) is 



^+,x — 



2Gmvx 

c^R 






774° = —(1 -I- c^) cos2i/) 



H 



( 1 / 2 ) _ Si 



6m 



+ 



8 m 



(5 -I- Ci) cos 1/ — 9(1 -I- c^) cos 



(91a) 

(91b) 



i7« = 



19 -f 9cf - 2ct - z/(19 - 11c/ - 6c/) 



cos 2ip 



-3s/(l + c/)(l-3ii) cos4i/ 



H 



( 3 / 2 ) 



Si Sm 
192 m 



57 -f 60c/ - c/ - 2 j/( 49 - 12c/ - c/) 



cos 4’ 



(91c) 
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73 + 40c? - 9c? - 2v{25 - 8c? - 9c?) 



cos 3i/i 



27 
625 

+ -— (1 — 2j/)s?(1 + c?) cos — 27t(1 + c?) cos 2ip , 



(91d) 






1 



22 + 396c? + 145c? - 5c? + -i^(706 - 216c? - 251c? + 15c?) 

o 



-51^2(98 - 108c? + 7c? + 5c?) 



cos 2tj} 



+ 15 ^* 



59 + 35c? - 8c? - -1^(131 + 59c? - 24c?) 

O 



+5i/^(21-3c?-8c?) 



cos 4'0 



81 



- ^ ( 1 - s? ( 1 + c? ) cos 6 V' 



Si 5m ( 
40 m \ 

-27 



ll + 7c? + 10(5 + c?)ln2 



sin ij; — 57t( 5 + c?) cos if) 



7-10 ln(3/2) 



(1 + c?) sin3'!/' + 1357 t( 1 + c?) cos3^/’ > . 



(916) 

The “cross” polarization admits a similar expression (see [106]). Here, ip 
denotes a particular phase variable, related to the actual binary’s orbital 
phase 4> and frequency to = u> 2 pn by 



, , 2Gmoj , f \ 

il) = (j) ^ ( ^ ) > ( 92 ) 

c'’’ V^o/ 

(j) is the angle, oriented in the sense of the motion, between the vector sepa- 
ration of the two bodies and a fixed direction in the orbital plane (since the 
bodies are not spinning, the orbital motion takes place in a plane). In (92), 
Wo denotes some constant frequency, for instance the orbital frequency when 
the signal enters the detector’s frequency bandwidth; see [106] for discussion. 

The previous formulas give the waveform of point-mass binaries whenever 
the frequency and phase of the orbital motion take the values uj and (/). To get 
the waveform as a function of time, we must replace w and </> by their explicit 
time evolutions oj{t) and <(>(t). Actually, the frequency is the time-derivative 
of the phase: w = d(j>/dt. The evolution of the phase is entirely determined, 
for circular orbits, by the energy balance equation dE/dt = —C relating the 
binding energy E of the binary in the center of mass to the emitted energy 
flux C. E is computed using the equations of motion (87), and C follows 
from (57) and application of the previous formalism [changing the radiative 
moments to the source moments, applying (83)-(84), etc...]; the net result for 
the 2.5PN orbital phase [72,75,73] is 
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4> = 4>q- 



V 



/ 9275495 284875 _ 1855 

V 14450688 258048 ^ 20^ 

/ 38645 15 ^ 

172032 - 2048"j 









+ 



(93) 



where 4>o is ^ constant phase (determined for instance when the frequency is 
Wo), and 0 the convenient dimensionless time variable 

0=^(ic-i), (94) 

tc being the instant of coalescence at which, formally, u>{t) tends to infinity (of 
course, the post-Newtonian method breaks down before the final coalescence). 
All the results are in agreement, in the limit i/ ^ 0, with those of black-hole 
perturbation theory [87,99-101]. 



8 Conclusion 

The formalism reviewed in this article permits investigating in principle all 
aspects of the problem of dynamics and gravitational-wave emission of a 
slowly-moving isolated system (with, say, vjc ^ 0.3 at most): the generation 
of waves, their propagation in vacuum, the back-reaction onto the system, the 
structure of the asymptotic field, and most importantly the relation between 
the far-held and the source parameters. Of course, the formalism is merely 
post-Newtonian and never “exact”, but in applications to astrophysical ob- 
jects such as inspiralling compact binaries this should be sufficient provided 
that the post-Newtonian approximation is carried to high order. 

Furthermore, there are several places in the formalism where some results 
are valid formally to any order of approximation. For instance, the source mul- 
tipole moments are related to the infinite formal post-Newtonian expansion 
of the pseudo-tensor [see (18) or (36)], and the post-Minkowskian iteration 
of the exterior held is performed to any non-linear order [see (43)]. In such 
a situation, where an inhnite approximate series can be dehned, there is the 
interesting question of its relation to a corresponding element in the exact 
theory. For the moment the only solid work concerns the post-Minkowskian 
approximation of the exterior vacuum held, which has been proved to be 
asymptotic [26]. Likewise it is plausible that the expressions of the source 
multipole moments could be valid in the case of exact solutions. 

The most important part of the formalism where a general prescription 
for how to proceed at any approximate step is missing, is the post-Newtonian 
expansion for the held inside the isolated system. For instance, though the 
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miiltipole moments are given in terms of the formal post-Newtonian expan- 
sion of the pseudo-tensor, no general algorithm for computing explicitly this 
post-Newtonian expansion is known. An interesting task would be to define 
such an algorithm, in a manner similar to the post-Minkowskian algorithm 
in Sect. 4. In the author’s opinion, the post-Newtonian algorithm should be 
defined conjointly with the post-Minkowskian algorithm, and should rely on 
the matching equation (16), so as to convey into the post-Newtonian field 
the information about the exterior metric. 

Note that even if a general method for implementing a complete approx- 
imation series is defined, this method may be unworkable in practical cal- 
culations, because not explicit enough. For instance the post-Minkowskian 
series (43) is defined in terms of “iterated” retarded integrals, but needs to 
be suplemented by some formulas, to be used in applications, for the retarded 
integral of a multipolar extended source. In this respect it would be desirable 
to develop the formulas generalizing (50)-(51) to any non-linear order. This 
should permit in particular the study of the general structure of tails, tails 
of tails, and so on. 

For the moment the only application of the formalism concerns the ra- 
diation and motion of point-particle binaries. Of course it is important to 
keep the formalism as general as possible, and not to restrict oneself to a 
particular type of source, but this application to point-particles offers some 
interesting questions. Indeed, it seems that the post-Newtonian approxima- 
tion used conjointly with a regularization a la Hadamard works well, and that 
one is getting closer and closer to an exact (numerical) solution correspond- 
ing to the dynamics and radiation of two black-holes. So, in which sense does 
the post-Newtonian solution (corresponding to point-masses without hori- 
zons) approach a true solution for black-holes? Does the adopted method of 
regularizing the self-field play a crucial role? Is it possible to define a regu- 
larization consistently with the post-Newtonian approximation to all orders? 
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1 Introduction 

Duality is presently considered the key to the Holy Grail of String Theory ~ 
it is supposed to provide links between the five known different superstring 
theories in ten dimensions, hoped to be just different limits of one unique 
eleven dimensional theory [1]. The main role of duality is to relate two differ- 
ent regimes - e.g. one of weak and one of strong coupling - of these theories. 
In most cases duality is not a very precise concept, just because the strong 
coupling regime is a matter of speculation. This is rather different from the 
duality transformations in the gravitational theories considered in this work, 
which have a very precise meaning - not the least because we are dealing with 
classical field theories (compare, however, [2] and for a modest attempt to 
use duality symmetry in Quantum Gravity, see [3]). The historical example of 
all these dualities is the duality between electric and magnetic fields in elec- 
trodynamics, which, when expressed in terms of the field strength, is just an 
example of the mathematical notion of Hodge duality for differential forms. 
Actually, the source-free Maxwell equations are not only invariant under a 
discrete duality, but under a continous one-parameter group of duality rota- 
tions. It is this kind of transformations, which is the subject of this article. 
Whereas, in general, the electromagnetic duality rotations are an ‘on-shell’ 
symmetry, i.e. a symmetry of the equations of motion and not of the action, 
the situation changes, if one considers time-independent solutions. In this 
case also the magnetic field can be derived from a (pseudo) scalar potential 
and the duality rotations expressed in terms of scalar potentials become a 
bona fide ‘off-shell’ symmetry of the ‘dimensionally reduced’ three dimen- 
sional theory. This replacement of the vector potential by a scalar one has 
analogues in higher dimensions playing an important role in the construction 
of supergravity theories through the process of dimensional reduction. A typ- 
ical example is the (pseudo) scalar ‘axion’, obtained as the dual of a gauge 
field 2-form in 4 dimensions. This scalar axion combines nicely with another 
scalar, the dilaton to a doublet giving rise to an SL{2) group of non-linear 
duality transformations [4]. A particular element of this group, replacing the 
dilaton by its inverse, lies at the heart of string duality (‘S-duality’)[5], where 
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the expectation value of the dilaton plays the role of a string coupling con- 
stant. 

Descending from four to three dimensions the group of continous electro- 
magnetic duality transformations is extended from the compact group SO(2) 
to the full non-compact Euclidean group in two dimensions. The reason for 
this enhancement is simply the possibility to shift the potentials by a constant 
without changing the field strengths. 

Let us now turn to (vacuum) gravity, also restricting ourselves to station- 
ary (time-independent) solutions and introduce the ‘twist potential’, playing 
a similar role as the magnetic potential in electrodynamics. It was Ehlers 
[6], who first observed that there is a discrete duality symmetry, relating the 
twist potential and the tt-component of the metric (the latter playing a sim- 
ilar role as the electric potential). Later it was recognized [7] that there is a 
whole SL(2) group acting on this doublet of potentials transforming station- 
ary solutions of Einstein’s equations into each other and that the potentials 
themselves can be interpreted as coordinates of a 2-dimensional ‘potential 
space’ with the geometry of the coset space SL(2)/SO(2). In fact there is a 
strong similarity between electric charge and mass and magnetic charge and 
the so-called NUT charge on the other hand. 

This analogy becomes even closer in the Einstein-Maxwell theory. 
Combining the two scalar potentials of gravity with those of the sta- 
tionary EM field, one may use them to parametrize the coset space 
517(2, 1)/S(U(1, 1) X U(l)) equipped with a natural action of the non- 
compact group SC/(2,1) [8]. This group contains ‘Harrison’ transformations 
[9] transforming mass into charge, e.g. the Schwarzschild solution into the 
Reissner-Nordstro m solution. 

The appearance of a non-compact group of ‘hidden’ duality transforma- 
tions like 517(2, 1) is a rather general phenomenon in the process of ‘Kaluza- 
Klein’ reduction of the maximal eleven dimensional supergravity theory [10]. 
Many different ‘hidden’ duality symmetries were found in various dimensions 
and under various truncations, all having in common the peculiar property 
that the related coset spaces parametrized by the scalars are non-compact 
Cartan Symmetric Spaces [11]. The underlying reason for this fact is still 
unknown. In the paper [12] with Breitenlohner and Gibbons we were able to 
determine a large class of four-dimensional theories containing abelian vector 
fields and scalars besides the gravitational field yielding suitable symmet- 
ric spaces upon reduction to three dimensions. Such field theories for scalar 
fields taking their values in a Riemannian space are familiar in Quantum 
Field Theory as non-linear cr-models. 

In view of the situation in the Maxwell theory one may wonder, whether 
it is possible to lift the off-shell symmetries to corresponding on-shell sym- 
metries in four (or even higher) dimensions. An attempt in this direction for 
N = 8 supergravity has been made by Nicolai and de-Wit [13]. But let us 
proceed in the opposite direction. Even more remarkable is the enhancement 
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of ‘hidden’ duality symmetries, when the process of dimensional reduction 
is performed one step further leading to an effective 2-dimensional theory. 
The simplest example is provided by solutions of vacuum gravity with a two- 
dimensional abelian isometry group, i.e. with two commuting Killing vector 
fields. It was Geroch [14] who observed that each given stationary, axially 
symmetric solution of the vacuum Einstein equations is accompanied by an 
infinite family of potentials, which in turn allowed for an infinite parame- 
ter set of infinitesimal transformations acting on the initial solution. What 
remained unclear in Geroch’s work was the precise Lie algebra structure 
of these infinitesimal transformations and even more so the structure of a 
corresponding group (the ‘Geroch’ group) of finite transformations. Later 
the problem of getting finite transformations found some seemingly different 
solutions (‘Solution Generating Methods’) - the HKX-transformations [15], 
the Backlund- Transformations of Harrison [16], Kramer and Neugebauer [17] 
and the Riemann-Hilbert Method of Ernst and Hauser [18]. While the first 
authors were mainly concerned with the actual construction of hitherto un- 
known solutions (‘Multi-Kerr’), Ernst and Hauser made the first serious effort 
to give the notion of the ‘Geroch’ group a more precise mathematical foun- 
dation. A deeper understanding of these somewhat mysterious constructions 
and their interrelations is provided by the observation - made already before 
the advent of the ‘Solution Generating Methods’ ~ that the corresponding 
2-dimensional reduced theory is completely integrable. This property is ex- 
pressed by the existence of a ‘Lax Pair’ [19,20], a system of linear differential 
equations with a ‘spectral parameter’, whose compatibility is equivalent to 
the non-linear field equations. It is important to observe that this fact is 
essentially based on the non-linear cr-model structure of the reduced field 
equations and thus is also valid for the 2-d reductions of the large class 
of 4-dimensional models considered in [12]. In fact, for the analysis of the 
group-theoretical significance of various steps in the implementation of the 
infinite dimensional Geroch group this was a valuable guiding principle [10]. 
In [21] it was shown how the group theoretical structure of the symmetric- 
space non-linear cr-models provides the clue to the analysis of the Geroch 
group, resp. its implementation. It is possible to construct a natural exten- 
sion of the triple (G,H,t) defining the symmetric space 

G/H (where r is the involutive automorphism leaving H invariant [11]). Here 
and are infinite dimensional groups of holomorphic functions (of 

the spectral parameter of the Lax pair) with values in the complexification 
of G. The elements of the coset space G ^°°'^ / are solutions V{t,x) of 
the system of linear differential equations mentioned before. Implementing 
the group on the coset space G^°°'^ / turns out to be equivalent to 
the solution of a factorization problem for group-valued analytic functions, 
a so-called Riemann-Hilbert problem. While the Backlund transformations 
correspond to meromorphic group elements, the HKX transformations may 
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be understood as exponentials of certain iril-potent elements of the underlying 
affiire Lie algebra. 

Whereas for most of the algebraic structures comrected with this complete 
iirtegrability the signature of the two Killing vectors providiirg the reduction 
to the 2-dimensional theory is irrelevant, the analytical properties of the so- 
lutions are rather different and thus the results of the stationary, axially sym- 
metric case cannot be directly transfered to e.g. cylindrical or plane waves. 
Nevertheless some attempts with interesting results have also been made for 
these cases [22]. Another interesting development - in particular in view of 
a possible quantization - is the inclusion of fermions for the supersymmetric 
models [23]. 

Although it may be very satisfactory for esthetical reasons to see all these 
beautiful ‘hidden’ duality symmetries emerge, there is also some practical 
‘spin-ofP. The duality groups are, in a sense, large enough to act transitively 
on certain families of solutions. This not only allows to actually construct all 
solutions of such a family from the simplest member (usually some solution 
of the vacuum theory like Minkowski or Schwarzschild), but also allows to 
prove uniqueness theorems for these families. This has to do with another 
aspect besides the group structure - the solutions of the 3-d resp. 2-d field 
equations are harmonic maps [24,25]. Such maps enjoy rather strong regu- 
larity and uniqueness properties, which can be used to generalize the well 
known uniqueness theorems [26] for static resp. stationary, axially symetric 
black hole solutions of the vacuum theory resp. Einstein-Maxwell theory to 
the large class of theories considered here. The corresponding proofs are re- 
markably simple and elegant. As to be expected for harmonic maps these 
results are strongest when the 3-space and the target space have definite 
curvature, happening for stationary, axially symmetric solutions [27]. 

The organization of this article is as follows: 

• Section 2 is devoted to a short discussion of the duality symmetry of the 
flat space Maxwell equations. 

• In Sect. 3 the emergence of hidden duality symmetries in the process of 
Kaluza-Klein reduction of higher dimensional (super) gravity theories is 
demonstrated. 

• In Sect. 4 we discuss the complete integrability of the theories obtained 
by reduction to 2 dimensions, in particular the implementation of the 
infinite dimensional Geroch group. 

• In Sect. 5 the cr-model structure obtained in the previous section is ap- 
plied to construct spherically symmetric solutions and to show existence 
and uniqueness properties of static resp. stationary, axially symmetric 
black hole solutions. 

• In the Appendix A we collect some important group theoretical properties 
of symmetric space non-linear tr-models and Appendix B contains the 
results of a recent structural analysis [28] of the models considered in 
[ 12 ]. 
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2 Electromagnetic Duality 

The best known example of duality transformations are those of the Maxwell 
field (see [29] for a recent exposition and further references). In vacuum the 
Maxwell equations are 

= 0 = 0 with *F>^'^ = , (1) 

or, when written in terms of the electric field E and the magnetic field B, 
d-E = {), dhE + B = Q, 

d-B = Q, ^^B-E = Q. (2) 

These equations are not only invariant under the obvious discrete exchange 
E *F ,*F ^ —F resp. E ^ B , B ^ —E, but even under the continuous 
rotations (conveniently written in complex U{1) form) 

E + i*E ^ F%E + i*F) resp. E + iB ^ F\E + iB) . (3) 

In contrast to the energy density ^\E + iB\^ and the momentum density 
E/\B = ■^{E—iB)A{E+iB), which are invariant under the duality rotations, 
the Lagrangean density E^ — B^ and the toplogical density E ■ B - real and 
imaginary part of {E + iB)^ ~ are not, but transform into each other. Yet, 
since E ■ B = the variation of the action under infinitesimal 

duality transformations is a surface term vanishing under suitable boundary 
conditions. There exists also a conserved Noether current corresponding to 
this variation. But since the second variation yields again the action density, 
the action is not invariant under finite such transformations. Thus, although 
the equations of motion are invariant under duality rotations, the action is 
not, and hence they are what is called an ‘on-shell’ symmetry in contrast to 
a true ‘off-shell’ symmetry, leaving the action invariant. 

There is another aspect of asymmetry, connected with the introduction of 
a vector potential A^. The usual choice, valid also in the presence of electrical 
charge, is to interprete the equation d -*F = Q as the integrability condition 
(Bianchi identity) for the existence of a vector potential E^^ = — d^A^. 

In vacuum, however, nothing prevents us to introduce a similar potential A^ 
for the dual of the field strength, *E^^ = 9^71^ — d^Af^, interpreting d- E = 0 
as Bianchi identity. Combining A and A to a 2-component potential A we 
may write Eq. (1) as [30] 

= 0 and F^^'^ = [2*F^^'^ with (4) 

The action can be expressed either through A or, equivalently, through A, 
but not in terms of A. 
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The situation changes, if we consider stationary solutions, i.e. solutions 
with a time-independent field strength. Then the Maxwell equations (2) tell 
us that (at least locally) there are scalar potentials (j) and ip for th® electric 
resp. magnetic field. Combining them into a complex potential = (j) + iip 
such that E + iB = d<P the duality transformations can be implemented 
through <P — >■ Using a gauge in which the vector potential is time- 
independent, the electric potential cj) can be identified with Aq (and similarly 
the magnetic potential r/) with tIq). The remaining Maxwell equations d- E = 
d ■ B = 0 combine into A<1> = 0, which can be derived from the 3-dimensional 
(‘dimensionally reduced’) action 5'red = | / explicitly invariant 

under the duality rotations. Thus the on-shell symmetry has become an off- 
shell one in three dimensions, provided we use the magnetic potential ip, i.e. 
exchange a Bianchi identity with a field equation. In fact, the reduced action 
is even invariant under the full 2-dimensional Euclidean group ISO(2), since 
we can shift the potential <Phy a constant. Considering the potentials 4> and ip 
as elements of the coset space I SO{2) /SO{2) we may interprete the action for 
<1> as describing a non-linear cr-model. Although this may appear somewhat 
fancy at this point, this way of describing the reduced theory nicely fits into 
the structure found in the more general cases to be considered below. 

One may wonder, how the 4-dimensional Lagrangean |(E^ — B^) has 
turned into the 3-dimensional one | = ^{E^ + B^). One way to obtain 

this is to add the Bianchi identity 9 • i? = 0 for Fij with a Lagrangean 
multiplier ip to the dimensionally reduced action 

J d^x (i(E2-B2)+V>9-s) = J + (5) 

and to independently vary with respect to B and ip. Variation with respect 
to B then just yields the algebraic equation B = dip and thus the last term 
can be omitted, if this identity is used as a definition for B. 

The duality symmetry is broken, as soon as an electric current is added 
as a source for the Maxwell field. In order to maintain the symmetry a cor- 
responding magnetic current has to be added as a source for *F. Yet, the 
latter violates the integrability condition for the existence of A^, as does 
the electrical current for A^. On the other hand we need the potentials as 
dynamical variables for the local variational principle. However, the inte- 
grability conditions are only violated, where the currents are non- vanishing. 
Hence, supposing that we are dealing only with point-like electric and mag- 
netic charges we can still maintain duality, if we cut out the positions of the 
charges from space. Although this destroys the simply-connectedness of the 
space, vector potentials nevertheless exist, if we allow for Dirac string singu- 
larities (resp. connections on non-trivial U{1) bundles [31]). Denoting electric 
resp. magnetic charges by q resp. p, we have to transform q + ip ^ E'^{q + ip) 
in order to maintain the duality invariance of the eqs.(l) in the presence of 
the corresponding point sources. Classically q and p can take any value, but 
quantum mechanically electric and magnetic charges of two particles have to 
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obey the Dirac quantization condition 

qiP 2 = 2'Knh n € Z , (6) 

again violating duality invariance. This can, however, be remedied introduc- 
ing dyons carrying both types of charge and obeying the Dirac-Schwinger- 
Zwanziger quantization condition [32,33] 

qiP 2 - q 2 Pi = ‘2 ttH n € Z , (7) 

defining a 2-dimensional lattice in the g,p-plane. There is one more important 
aspect of the above quantization condition related to the fact that the charges 
in a gauge theory act not only as sources of the gauge field, but also as 
coupling constants of the minimal coupling of the point particles to gauge 
fields, yielding the forces acting on the charges in the presence of the gauge 
field. The relation (6) shows that weak coupling of electric charges (i.e. small 
values of q in natural units) implies strong coupling in the magnetic sector 
and vice versa. It is this aspect of duality that carries over to the modern 
developments in string theory. 

With the rise of non-abelian gauge groups in particle physics it was natural 
to attempt an extension of the abelian duality transformations to Yang-Mills 
theories. However, as was shown by Deser and Teitelboim [34], there is no 
such extension acting on the potentials or the field strength. Nevertheless, as 
was argued by Montonen and Olive [35], there might emerge a new form of 
duality in the quantized theory relating the original gauge theory to its soliton 
sector. Since in quantum field theory particles are represented by local fields 
there is no distinction at this point between quantum particles corresponding 
to solitons and those corresponding to the original gauge and Higgs fields. 
The proposal of Montonen and Olive was to find a duality that exchanges the 
two types of particles resp. fields. A specially favourable candidate for such a 
duality is the N=4 extended supersymmetric gauge theory. More recently the 
proposal of Montonen and Olive has been extended and made more concrete 
by Seiberg and Witten [36]. 

3 Duality in Kaluza— Klein Theories 

Kaluza-Klein theories in four space-time dimensions are obtained from Ein- 
stein’s theory in D dimensions assuming the existence of an isometry group 
generated by D — 4 Killing vector (KV) fields. Expressing the D-dimensional 
theory in terms of 4-dimensional fields one obtains by this ‘dimensional re- 
duction’ an effective four dimensional theory with gravity, vector fields and 
scalar fields. Whereas for non-abelian isometry groups this is in general an 
intricate procedure (except if the group acts freely, i.e. the orbits are home- 
omorphic to the group itself), it is more or less straightforward for abelian 
groups generated by commuting KV fields, which we will consider from now 
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For stationary resp. stationary, axially symmetric solutions of this four 
dimensional theory further dimensional reductions yield three resp. two di- 
mensional theories. In the following sections we will study the mechanism 
of dimensional reduction first generally going from D to d dimensions and 
subsequently for the special cases d=4, 3 and 2. 



3.1 Dimensional Reduction from D to d Dimensions 



We shall first investigate the general dimensional reduction of the gravita- 
tional field in a Z3-dimensional space invariant under a {D — d)-dimensional 
abelian isometry group. Under dimensional reduction we understand a pa- 
rametrization of the Z3-dimensional gravitational field invariant under the 
isometry group in terms of suitable fields defined on the space of orbits Sd of 
the isometry group. In order for this orbit space to have a reasonable mani- 
fold structure the action of the isometry group has to fulfil certain regularity 
conditions [37]. However, since the decomposition of the gravitational field 
is based on the Killing vector fields describing the action of the Lie algebra 
of the isometry group such global conditions play no role in the following 
discussion. They become however relevant, when one wants to reconstruct 
the D-dimensional manifold. Although the process of reduction could clearly 
be formulated in a completely geometrical, coordinate independent language 
it turns out to be convenient to use an adapted coordinate system 

a;^ = (a;™,x™) with x™ € and x™ £ (8) 



such that the Killing vector fields are AT™ = and hence the fields consid- 
ered are x-independent. The coordinates x™ parametrize the d-dimensional 
orbit space Sd — the space-time manifold of the reduced theory. The grav- 
itational field in the D-dimensional space is described by the metric ten- 
sor gxL or alternatively by the D-bein fields related to the metric by 
gxL = {E'^)k^VabE^ L, where ^ab = (H — ••■— ) is the D-dimensional 
Minkowski metric. The invariance under local Lorentz transformations acting 
on E^k from the left can be used to bring the D-bein into the form 



E^k = 



/ Ae“fc 0 \ 

[e^rnA^k e^kj 



( 9 ) 



adapted to the decomposition into (D — d) and d-dimensional components. 
The field is a (D — d)-bein for the isometry group, whereas e°‘k is a d- 
bein for the orbit space, A is a suitable conformal factor chosen later in order 
to simplify the Lagrangean. The fields A^k are a column of D — d vector 
potentials. The vanishing of the field strength F'^ki = dkA'^i — diA'^k is the 
condition for the hypersurface orthogonality of the Killing vector fields A''". 
From e°‘fh we can build the metric m on the isometry group as usual m = 
—(Afje with fj = (—...—) in case all the Killing vectors are space-like resp. 
fj = {-\ — ...— )if one of them is time-like. The D-bein field E^k transforms 
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covariantly under diffeomorphisms and local Lorentz transformations, but 
the special triangular form eq.(9) is preserved only by a subgroup consisting 
of 



• diffeomorphisms and local Lorentz transformations in d dimensions, 

• local Lorentz transformations in D — d dimensions depending only on x 
and not on x, 

• global GL{D — d) transformations acting linearly on the coordinates x, 

• diffeomorphisms of the special form x ^ x + f{x) acting as gauge trans- 
formations A^k — t A^k + dkf^{x) on the vector potentials. 

The Lagrangean for the Z?-dimensional gravitational theory can be expressed 
by the d-dimensional fields of the parametrization eq.(9) 



^(D,d) ^ --ER 



= epX 



d-2 






-4p ^dkpdip-8{d-l)X ^dkXp ^dip 
-4{d-l){d-2)X-^dkXdiX] 



^Tr(m ^dkmm ^dim) 



( 10 ) 



where hki = rjabF^k&^i and p = det(e). R resp. R are the scalar curvature of 
E^k resp. e“fc- 

For d > 3 we introduce p = X In addition we can choose X = p 
and eliminate the scalar factor in front of R and hence obtain the conventional 
form of the gravitational Lagrangean in d dimensions 



C(D,d) 



= e 



“5-^- + ^(Tr(/r ^dkPP ^dip) 



( 11 ) 



The Lagrangean describes the interaction of a d-dimensional gravita- 

tional field e“fc with D — d abelian vector fields (A^)j^ and a symmetric matrix 
p of scalar fields, which can be considered as taking values in the coset spaces 
GL{D — d)/SO{D — d) resp. GL{D — d)/SO(l, D — d — 1) depending on the 
signature of the metric fj. The expressions ciTi{p~^dp)'^ + C 2 {Tr{p~^dp))'^ 
are invariant metrics for these spaces. The action for p describes a so-called 
non-linear cr-model. Non-linear cr-models are field theories over a space-time 
manifold E with coordinates x“ and metric gapix) and fields assuming values 
in a target space <P with coordinates (jf and metric 7 ^ . The action for such 
a non-linear cr-model 



Ss = 



s 



y/gdxg°'^ {x)da(j)\x)di3(j}\x)"/ij{(j){x)) 



( 12 ) 
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leads to the field equations 

= 0 , (13) 

where 1)*^. are the Christoffel symbols of the metric 7^ . Solutions of eq.(13) 
are known as harmonic maps in the mathematical literature [24] . Many of the 
well-known (existence and uniqueness) results on harmonic functions could 
be generalized to this class of maps. Some interesting applications in gravity 
can be found in [25]. The target spaces considered here are very special - 
they are riemannian resp. pseudo-riemannian symmetric spaces G/H [11]. 
In Appendix A we have collected some important group theoretical aspects 
of these spaces. Using coset representatives M with values in the group G 
(compare Appendix A for details and notation) we can rewrite eqs.(12,13) in 
the form 

(14) 

i; 

= 0 . (15) 

where Ja = \M~^daM ■ 

The vector fields (A™) transform with a vector representation of GL{D — 
d) and are coupled non-trivially to the scalars, which play the role of space- 
time dependent dielectric constants. In order to get a positive energy density 
it is essential to have fj = (— . . .— ), which we have already anticipated in 
taking the additional dimensions greater than four all space-like. 

3.2 Reduction to d = 4 Dimensions 

The special case D = 5, d = 4 is the original Kaluza theory [38] with the 
Maxwell field and the scalar field m = . 

There is an important aspect of the case d = A {D arbitrary) connected 
with generalized duality transformations of the Maxwell fields A^^- The 
Lagrangean eq.(ll) contains the vector fields A™^ only through their field 
strengths since the scalar fields are neutral with respect to the gauge 

group acting on the A’s. Correspondingly the field equation for the vector 
fields is simply 

= 0 (16) 

without any currents on the r.h.s.. In addition the field strengths obey the 
Bianchi-identities 

*F'^''.^ = 0 with . (17) 

Putting pFn\ = *Fk,\ we can interprete eq.(16) as the Bianchi-identity for 
FkA, which accordingly are the field strengths of potentials (Am)^ obeying 
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the field equation 

= 0 . (18) 

The potentials AfhK provide an alternative description of the vector fields. 
They transform, however, contragrediently under GL{D — 4) similarly as ^ 
and 





gAn 


(19a) 


A« - 


{g^r^An 


(19b) 


M - 


(5^)-Vff-' 


(19c) 


■ 


gg~^g^ ■ 


(19d) 



Like in the case of source free electrodynamics in fiat space discussed in Sect. 
2 we can render the symmetry of the field equations (16) and (18) under the 
discrete duality transformation explicit putting them together 

to form the 2{D — 4) components of a vector potential 

with the field strength — d^A^^ obeying the twisted self-duality 

constraint 

= with l? = yM= (21) 

satisfying again 17^ = —1. Due to (21) the field equation 

(1?F"^);« = 0 (22) 

is obviously equivalent to the Bianchi identity 

(*F«^);, = 0. (23) 

Again we may look for a Lagrangean describing the interaction of the A^’s. 
As described in Chap. 2 this can be achieved considering the field strength 
Fk[/ as independent dynamical variable besides the potential A„ and adding 
the Bianchi identities with a Lagrange multiplier A^ 

JC- = ^C' = + ^-Al*F^\,^ (24) 

to obtain 

C" = -^-Fl^^^-^F'^^ . (25) 

As before there is no way to express the Lagrangean symmetrically in and 
An and hence the discrete duality transformation A^ — >■ A„ is an ‘on-shell’ 
symmetry valid for the equations of motion, but not for the action. 
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The fact that a vector field gives rise to another vector field via dual- 
ization is clearly special to 4 dimensions. In general a p-form will lead to a 
(d — p — 2)-form by dualization. Certain supergravity models [4] for instance 
lead to gauge potentials which are 2 -forms yielding by dualization a scalar 
‘axion’ field in 4 dimensions. In fact, it turns out to be possible to generalize 
the structure of the four-dimensional theory we have obtained through the 
Kaluza-Klein reduction in a way that allows to include all the known super- 
gravity models with abelian gauge fields [12]. Besides the gravitational field 
these theories contain scalar fields of a non-linear cr-model with a target 
space ^ which is assumed to be a non-compact Riemannian symmetric space 
G/H. In addition there is a fc-dimensional column = (A^^) {I = 1, ... ,k) 
of real vector fields with field strengths Fap = daAp — dpAa and their duals 
*pa /3 _ _^^a/SjS ^**p _ _ p-j satisfying Bianchi identities = 0. 

The most general gauge invariant action quadratic in the field strengths is 

Sv = J y/gdx[-^Flp{p{^)F‘^f^ - v{$)*F^!^)) (26) 

i: 

where p = {pij) and v = {vu) are real symmetric matrices depending on 
the fields This action yields field equations 

= 0 (27) 



Introducing in the by now familiar way dual potentials A^ = {A^j) we have 
obtained 2 k vectors 

and field strengths = daAp — dpA^ satisfying the linear relation 

F^p = YM*F^p = n*Fo.p (29) 



with 



F = 



0 77"^ 

—77 0 



M = 



ly p ^ u 

k,T\-1 ,,-i , 



V p ^ r] ^ 



{rA) ^ pL ^ ly ( 77 "^) ^ 77 ^ 77 ^ 



(30) 



where 77 is some real k x k matrix and the symmetric matrix M satisfies 
YMY = —M~^ and hence = —1. The case of the coupled Einstein- 
Maxwell equations corresponds to putting p = 1 , v = Q and G = H = SO{ 2 ). 
The equations 

= -{YMF^^),^ = 0 (31) 



can be interpreted as field equations and Bianchi identities for either Fap or 
*Fap. These equations are G-invariant under the transformation 



G 5 g: JAap P{g)Pa 0 



(32) 
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provided the real representation p satisfies = Y~^ p{g) Y and we can 

construct a matrix M{^) of the form (30) depending on the scalars in such 
a way that 

GBg: M(^) ^ p{rT M{^) P{r^) ■ (33) 

The contribution from Sy to the stress tensor, which takes a particularly 
simple form using the ‘doubled’ field strength T) 

= -Fl^ p + \g^p T’T ^ 

= - \Fl^ MFp^ = - \Fl^ Y (34) 

as well as the contribution to the scalar field equations 






- ^ fnF'^ ( *pap\ _ 1 r-pT 



5v 

5(j) 



SM _ 
50® 



(35) 



will then be explicitly G-invariant. Since Tqq^ must be positive for a physically 
meaningful theory the matrix M (and hence p) must be positive definite. 

Collecting all the terms in the Lagrangean and the field equations describ- 
ing the coupling of the gravitational field gap, the vector field strenghts and 
their duals comprised in F_^p and the scalars 0® parametrizing M we obtain 

+ \g^^{M-^daM,M-^dpM) 

-\fIp (/^(^)i"“^ - vWF ^^) ) , (36) 



Rcp = -\eL^MF^'^ + -^{M-^daM, M-^dpM) , (37) 

{MF°‘^).a = 0 , (38) 

= (39) 

where (■ • .)pr denotes a projection on the Lie algebra of G (see Appendix A). 



3.3 Reduction to d = 3 Dimensions 

In this section we shall see how field configurations of the 4-dimensional 
theories discussed above allowing one Killing vector field give rise to a ‘di- 
mensionally reduced’ 3-dimensional field theory. Since this reduction can be 
performed irrespective of the signature of the KV field, it can be applied to 
time-translations or a space-like isometry like axial rotations, but for reasons 
of simplicity we shall use the notations for time-translations. In case we want 
to explicitly refer to a space-like KV we shall indicate this with a prime on 
the corresponding field. 
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Since our 4-dimensional models contain gauge fields and scalars besides 
the gravitational field, we have to require that the Lie derivative of the field 
strengths and of the scalar fields vanish. Then we can and will use a gauge in 
which also the Lie derivative of the gauge potentials vanishes. As before 
we choose adapted coordinates such that the isometry is just a translation 
(e.g. K = ^). All the fields and will then depend only on the 

remaining three coordinates a;™ (m = 1,2,3) parametrizing the orbit space 
S3 of the action of K. In these coordinates Ka has the form Ka = A) 

with Z\ = and the metric ga /3 can be decomposed as (compare eq.(9) for 
the 4-bein) 

f 'l\hjnn S Akmkn Ak', 

Ak^ A 

We will use the rescaled metric h^n on the three dimensional orbit space 
S3, since it leads to the standard form of the Lagrangean in 3 dimensions. 
In order to do this we have to require A ^ 0. hmn is positive definite if K 

is time-like, i.e. Z\ > 0 and has the signature (-1 ) if K is space-like, i.e. 

Z\ < 0. Similarly we decompose the vector fields = {Al^ + kmA^ , A^) into 
pieces A^^ and A^ perpendicular and parallel to K. We can now rewrite the 
Lagrangean (36) in the form (apart from surface terms) 

£ = - ^dmA^g{^)dnA 

+ 2/\2 ^rnAd„A'j -| —kmnk™'^ (41) 

+{Fmn + fc„„A)T(-|^(^)(F™" + fc™"A) + 

where is the scalar curvature for hmm = dmkn — dnkm, Fab = 
dm An — dnAm and all indices are raised or lowered with the metric h. The 
Killing vector field is hypersurface orthogonal iff kmn = 0. 

If the original field configuration was a solution of the 4-dimensional field 
equations eqs.(37) then hmn, A, km, A^, A^ and are a solution of the 
three dimensional field equations derived from the action L cPx and vice 

versa. The field equations for the 3-dimensional vector fields A^ and km 

-Agi^XF^'^ + k^^A) + ^e^^Piy{^)dpA) =0 (42) 

-k A^ (-Ag{^){F"^^ + k^^A) + ^e"^^Pv{$)dpA'j = 0 (43) 

can be considered as Bianchi identities for dual potentials Aj (which are just 
the time components K°^Aai of the dual potentials Aq,/) and ip (the twist 
potential) . 

pmn p ^ _ pj-^dpA) ( 44 ) 

y/h ^ 
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imn _ 1 ,mnp ^ , 



Vh 



with 



dmi^ = U]m-{A (77 ) dmA - A Tj d^A) . 



(45) 

(46) 



Putting A = {A, A) we can rewrite the twist vector in the explicitly G- 
invariant form 

+ A^Y~^dmA . (47) 

The Lagrangean of the dimensionally reduced three dimensional theory be- 
comes 

£(3) = + 

+ {Jm, Jn) - idraA^MdnA^ ) (48) 

= (</.)) 

where is a metric on the ‘potential space’ <P parametrized by <(> = 

(Z\, tp, A-: 4')- Thus we have obtained a non-linear cr-model with a target 
space coupled to (three dimensional) gravity. 

For a space- like Killing vector (Z\ < 0) the metric on ^ is positive definite, 
but for a time like-Killing vector {A > 0, stationary solutions) the metric is 
indefinite with 2k negative terms due to the fields A originating from the k 
vector fields in the 4-dimensional theory. 

In terms of the various components the field equations are 



= dm(!)"dncAlij{4>) , 




(49a) 


(^) =0. 
V / ;m 




(49b) 


(^d^A+Y-^A^) =0, 




(49c) 


+ =0, 
\ / ;m 




(49d) 




1 ) =0. 
pr/ ;m 


(49e) 


If all potentials A for the vector fields vanish then ujm = and {A,tj}) 

parametrize the well known SL(2)/SO(2) cr-model of pure gravity [7]. Before 
we proceed to the general case let us shortly recall the results about this 
simplest case. In order to recognize the metric ds^ = A~^{dA^ + dijA) as the 
invariant metric of SL(2)/SO(2), it is necessary to find a suitable parametri- 
sation of the coset space in terms of the coordinates A and ip. This can be 



done parametrizing suitably selected coset-representatives in the group itself 
or using some matrix representation. In the former case we may choose a 
basis for the Lie algebra of SL{2) and use the exponential map. A canonical 
choice for the Lie algebra of SL{2) consists of the three elements (d, e, k) with 
the commutation relations 
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The choice 



P = 



(51) 



may be termed ‘triangular’ in view of a corresponding representation by tri- 
angular matrices 



P = 



fVA 0 \ 

i ^ (52) 

\Va VaJ 

It is easily checked that the Killing metric of SL{2) restricted to the 
coset space yields the desired expression in terms of A and ip. As explained 
in Appendix A the group SL(2) acts in a non-linear way on the triangular 
representatives. It is easy to check that the Lie algebra is realized on A andip 
through the infinitesimal transformations 



d : 


5A 


= -A 


Sip = —Ip 


(53) 


k : 


5A 


= 0 , 5ip 


= -1 


(54) 


e : 


5A 


= 2ipA, 


Sip = ip^ -A^ . 


(55) 



Obviously the elements k resp. d correspond to a trivial shift of ip resp. 
a simple rescaling of the KV, whereas the action of e really changes the 
solution. The latter is usually referred to as ‘Ehlers’ transformation. 

Another convenient parametrization is in terms of the complex ‘Ernst’ 
potential £ = ip + iA undergoing a Mobius transformation under the group 
action of SL{2). It is easy to check that the whole group is generated by the 
shift £ ^ £+ciw combination with the discrete inversion £ ^ \ j£ discovered 
by Ehlers [6]. 

Turning back to the general case the metric on the space ^ and the field 
equations (49a-49e) are obviously invariant under 



• G-transformations (acting on cp and A), 

• ‘electromagnetic’ gauge transformations (with a constant vector a) 



'tp — y Y , A. — y j 4 “h Q, , 


(56) 


• twist gauge transformations 


Ip ^ ip + b , 


(57) 


• scale transformations 






A -)> cM , 


ip — >■ c^ip , A — >• c A . 


(58) 



It turns out that for many interesting theories (e.g. Einstein-Maxwell, various 
supergravities) it is possible to extend the Ehlers transformation to an invari- 
ance of the whole target space Commuting this generalized Ehlers trans- 
formation with (infinitesimal) electromagnetic gauge transformations yields 
generalized Harrison transformations [9]. The structure of the Lie algebras 
corrsponding to this extensions given in Appendix B. Their action on the 
scalar potentials is to lowest order (consult [28] for the complete transforma- 
tions) 
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• (infinitesimal) generalized Ehlers transformations 

5 A = 2^ A 

= - A"^ + ^A^MAA + OiA^) 

H = 0{A^) 

SA={'^~ AYM)A + 0{A^) , 

• (infinitesimal) generalized Harrison transformations 

SA = -cA^YaA 

S^_ = IA{AM - ^Y)a + 0{A^) 

5<j) = 0{A) 

SA={i}~ AYM)a + 0{A^) . 



(59) 



(60) 



The Lie algebra of infinitesimal transformations gives rise to a noncompact 
Lie group G with maximal compact subgroup H' and depending on the sign 
of A the target space is either the Riemannian symmetric space (p' = G/H' or 
the pseudo Riemannian symmetric space <P = G/H, where iJ is a noncompact 
real form of H' . The dimensions of G and H are 



dimC? = dimG+dim5'L(2)+4A: , dimH = dimH+dimSO{2)+2k . (61) 

The submanifold where H = 0 (i.e. G/HflG / H') is SL{2)/SO{2) x G/H and 
the generators of the coset space transform under the two dimensional rep- 
resentation of SL{2) in order to reproduce the scale transformations eq.(58). 
In Appendix A we give a complete list of all combinations of groups G, H, G 
and H corresponding to a time-like KV. For each of them we can start from 
the Lagrangean eq.(48) for the three dimensional theory and reconstruct the 
Lagrangean eq.(36) of the corresponding 4-dimensional theory. 

In order to identify the potentials with suitable coordinates for the coset- 
space abstract ‘triangular’ generators may be exponentiated yielding [28] 

p ^ ^ ^Q2) 

Or we proceed in a similar way as we did previously for G (compare Appendix 
A) and choose a (possibly complex) irreducible matrix representation p of G 
and a hermitian matrix X satisfying p{r{g)) = X~^ p{g~^) A and define P = 
p{P), M = X P = X p{M). Introducing the currents Jm = 
the Lagrangean eq.(48) can be rewritten in the form 

= M-^dnM)\ (63) 

V 2 8 / 

with a constant c depending on the representation p and on the particular 
(T-model. 
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The field equations eqs.(49a-49e) become 

R± = {Jm,Jn) (64) 

= 0 . (65) 

but not all of the conserved currents Jm are independent, since r(J) = 
An alternative way writing the field equations related to the 
triangular representers P is (compare Appendix A) 

D‘^Ja = 0 with Daj0 = J/3-a - [AonJfl] (66) 



In Sect. 5 we will study solutions with asymptotically Euclidean and 
(T-model fields which have an asymptotic multipole expansion 



M(^)^^r-"M„(i?,<p) . (67) 

n—0 



Without loss of generality we can assume that we have the asymptotic values 
Aq = 1, '0Q = Ag = 0, Mg = 1 corresponding to Pg = 1, Mg = 1 or Mg = X. 
From the asymptotic expansion of M we can read off the ‘gauge’ charges 



Q 



1 

47T 





( 68 ) 



taking values in the Lie algebra of G. In fact those in the Lie algebra of H 
vanish, due to the relation r(M) = M~^. The subgroup H C G leaves the 
asymptotic value Mg invariant and transforms the charges according to 



H^h: Ml ^hMih~^-. 



(69) 



The precise form of this action of H on the charges depends on the particular 
(T-model under consideration. There are, however, in all cases, generators of 
H which differ from the transformations eqs. (59-60) by suitable multiples of 
the transformations eqs. (56-57). These we can use to transform the (electric 
and magnetic) vector charges and the NUT charge into the mass and the 
scalar charges and vice versa. 



3.4 Reduction to d = 2 Dimensions 

Stationary, axially symmetric configurations of the 4-dimensional theory are 
characterized by their invariance under two commuting Killing vector fields 
K = ^ resp. at' = ^ describing time-translations resp. axial rotations. 
Similar to the case of one KV we can perform a ‘dimensional reduction’ 
from 4 to 2 dimensions. Since we want to make use of the (T-model structure 
obtained for the 3-dimensional theory after suitable dualizations we prefer 
to employ a two step procedure. First we use one KV field to reduce from 



Duality and Hidden Symmetries 291 



4 to 3 dimensions and then the second one to do the step to 2 dimensions. 
Depending on whether we take the KV K or K' to perform the reduction 
from 4 to 3 dimensions we end up with two different tr-models corresponding 
to G/H resp. G/H', parametrized by the matrices M resp. M' . In the first 
case we obtain a pseudo- Riemannian coset space in the second a Riemannian 
one, whereas the the opposite holds for the remaining 2-d orbit spaces S 2 - 
In order to be able to do the step from 3 to 2 dimensions we first have 
to make sure that the Lie derivative of the 3-dimensional fields hmn and M 
with respect to the second KV vanishes. For hmn this is a direct consequence 
of the commutativity of the two KVs. For the electromagnetic potentials A 
we find that their Lie derivative has to be constant. This constant has to 
vanish for asymptotically trivial configurations with a regular rotation axis. 
In adapted coordinates the vanishing Lie derivative of hmn and M means they 
are independent of t and Lp. Hence the only non-trivial step in the reduction 
from 3 to 2 dimensions concerns the decomposition of the 3-metric hat which 
we parametrize in analogy to eq.(40) in the form 



h 



77171 



f X'^hki + p^hkbi p^bi \ 

I P^bk ) 



(70) 



In terms of these new fields the Lagrangian (63) becomes 



£(2) = ^ 



-]^R--^bMb^^ + ^{M-^dM,M-^dM)-X-^d\p-^dp . (71) 



A novel feature for d = 2 is that A cannot be used to remove the factor 
p multiplying \fh in front of the Lagrangian. This is a consequence of the 
conformal invariance of the 2-dimensional theory. 

A simplification occurs with the vector fields bk which loose their dynam- 
ical degrees of freedom in 2 dimensions. From the field equation 



{p^X-^b%i = 0 



(72) 



it follows that the dual field strength *b = -Aj=e^^p^X "^bki obeys the equation 

2y/h 



e^^di*b=0 



(73) 



and hence *b is constant that vanishes for asymptotically Minkowskian so- 
lutions and consequently bki = 0, i.e. the orbits of the KVs are orthogonal 
to 2-surfaces A 2 . This establishes a ‘Generalized Papapetrou Theorem’ [39]. 
Assuming the vanishing of b in the following C simplifies to 



£(2) ^ 









X-^dXp-^dp 



The field equations derived from £^2) ^re 

Rm - ^hkiR = +^{M~^dkM,M~^diM) - 2A"^i9(fcAp"^5qp 



( 74 ) 
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- ]^hki(^{M-^dM,M-^dM) -2\-^dXp-^dp^ (75a) 

{pM~^d’^M),k = 0 , (75b) 

{d^p)-k = 0 . (75c) 

We have omitted the field equation for A, because it is a consequence of 
eq.(75a) as we will be argued below. 

The 2-dimensional metric h^i can locally be brought to the conformally 
fiat form hki = hSu by a suitable choice of coordinates. Finally h can be 
absorbed into the conformal factor A leading to hki = Ski- With this choice 
Eq. (75a) turns into a system of first order equations for A, since the left hand 
side vanishes. Since the field equations for M and p are conformally invariant 
they are independent of A and hence the same as in fiat space. In particular p 
is a harmonic function on R^. Together with its conjugate harmonic function 
z defined by dz = —*dp (note that **d = —d in a 2 dimensional space with 
definite metric) it provides a canonical coordinatization of the 2-dimensional 
reduced manifold as long as dp 0 (Weyl’s canonical coordinates). Eq.(75a) 
then becomes 

X~^d,X = ^{M-^dpM,M~^d^M) 

X~^dpX = ^ (^{M~^dpM, M~^dpM) - {M~^d,M, . (76) 

Using d± = dz idp with the property *d± = ±id± we can write these 
equations in the form 

a±lnA = T|(J±, J±). (77) 

This equation allows to get A from M (once the latter has been obtained 
solving eq.(75a)) through simple integration. 

4 Geroch Group 

In [19,20] it was recognized that the equation of motion (75c) for M{x) can be 
obtained as the integrability condition of a system of linear differential equa- 
tions involving a spectral parameter (‘Lax Pair’) in analogy to well-known 
‘Completely Integrable Systems’ like KdV or Sine-Gordon [41]. Although 
originally formulated for the Einstein vacuum theory with M G SL{2) the 
same construction works for arbitrary symmetric spaces G/H. The linear 
system used in [20] can be written as 

d±UU~^ = (78) 



with U{t,x) taking values in G (for real t) resp. some matrix representation 
of G. A second, even more useful form of this Linear Spectral Problem (LSP) 
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is obtained for V{x,t) = P{x)U{t,x) obeying [21] (compare Appendix A for 
the definition of A and J) 

d±V{x, t)V{x, t)-^ = A± + (LSP) (79) 

1 ±%t 



In order to reproduce the explicit p-dependence of the equation for P resp. 
M the ‘spectral parameter’ t has to depend on p and z. One finds that t(p, z) 
has to obey the pair of differential equations 



d±t = t 



1 =F 
1 ± it 



(80) 



implying 

d^{z+P-{\-t)^=Q (81) 

and thus 

(82) 

(where w is a constant of integration) with the solutions 

t±{w,x) = -({z-w)±^/{z- wY + pA = . (83) 

p\ H 

The pairs (w, t±) define for each given x = {z, p) with p 0 a two-sheeted 
Riemann surface Ti-x, with the branch points w = z ± ip. The replacement 
t — >■ — y exchanges the two sheets. We can choose the branch cut along the 
line segment (z = Rew, p < jlmicj). For z < Reui resp. z > Reiu the value 
of t+ lies inside resp. outside and t_ lies outside resp. inside of the unit circle. 
For p = 0 the Riemann surface degenerates and splits into two disconnected 
planes w = z and the function t{w) becomes singular. One finds in particular 



t+ 





oo for z < Re w , 
0 for z > Re 1 C 



(84) 



Since t depends on x it is important to recognize that dV is to be interpreted 
as differentiation for fixed w, i.e. 



d±P{t,x) = d±P{t,x) +d±t{w,x) 



^ dV{t, x) 
~dt 



(85) 



This implies that to any given value of w there will be two solutions V±{w, x) 
according to the choice t±{w). Under suitable conditions on P{x) (see below) 
V± will however define a single valued function P{t, x) on some domain on TZx- 
We still have to supplement the equation for P with a suitable normalization 
condition. Since for t = 0 (resp. w = oo) the r.h.s. of eq.(79) becomes d±PP~^ 
we can impose P{0,x) = P{x). More precisely we shall try to find solutions 
V{t,x) having a Taylor series expansion around t = 0 starting with P{x). 
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For real t eq.(79) is invariant under the transformation 

^ t(V{-j,x)) . ( 86 ) 

This transformation plays an important group-theoretical role. We may 
consider the r.h.s. of eq.(79) as an element of the infinite-dimensional loop 
algebra with respect to the Lie algebra of G [40] . Applied to elements of this 
Lie algebra is a natural extension of the involutive automorphism r of G. 
Assuming for the moment there is a corresponding infinite-dimensional group 
(^(oo) (the ‘Geroch group’), the function V{t,x) considered as an element 
of that group is again ‘triangular’ in the sense that it has a Taylor series 
expansion at t = 0 (i.e. only positive powers of t) and the t-independent term 
is triangular. 

A central object for completely integrable systems is the ‘Scattering’ or 
‘Monodromy’ matrix of the LSP [41]. As a kind of non-linear Fourier trans- 
form it encodes the properties of the solution of the non-linear equations 
of motion in its dependence on the spectral parameter of the LSP. Usu- 
ally this object is obtained through some limiting procedure from special 
(‘scattering’) solutions of the equations of motion with specified asymptotic 
behaviour. Although it is well known that there are problems with this def- 
inition of the Monodromy matrix for non-linear cr-models in flat space, it 
was recently shown [2] that the cc-dependence of the spectral parameter t 
allows to overcome this problem. We will however avoid this problem alto- 
gether by a purely ‘algebraic’ definition. Guided by eq.(A.214) we can use 
the automorphism to construct 

M{t,x) = T^°°\V~^{t,x))V{t,x) . (87) 

Glearly, in order for this definition to make sense, we have to assume for 
the moment that the two factors have an overlapping domain of existence as 
functions of t for given x. This is highly non-trivial, since V{t, x) is assumed to 
be analytic at t = 0 and thus x) = TVi—lji, x) is analytic at f = oo. 

Since 'P{t,x) and t{'P{—^,x)) are solutions of the same linear differential 
equation we find, as a consequence, that M{t,x) satisfies d±M{t,x) = 0 
and thus is independent of x, but may and in fact does depend on w. The 
corresponding G- valued function M.{w) is our definition of the Monodromy 
matrix of the LSP (compare [42] for a similar construction for the KdV 
equation) . 

The inverse transformation (‘Inverse Scattering Transform’) requires the 
factorization of A4 considered as a function of w{t,x) in the form (87). In 
order to give this factorization a more precise meaning, we have to determine 
the analytical properties of V{t,x). To be specific, let us assume that P{x) 
is regular outside some compact region containing possible singularities or 
sources. Regularity is here to be understood with respect to the manifold 
structure of G as a Lie group or a suitable matrix representation thereof. 
Thus let us assume that P{x) is asymptotically regular in the sense that 
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• P{x) is analytic in a simply connected domain X in the closure S 2 of S 2 
whose complement fh = 172 \ df is closed and contained in a semi-disk 

'Dr = {{z,p):p>0, r = < R} (88) 

of radius R; 

• the configuration is asymptotically flat and sufficiently regular at infinity, 

i.e. 

P{x) = l + o(^^y dP{x) = o(^^^ . (89) 

For a solution to have a physical interpretation one should add further 
requirements to those put down in the preceding conditions as e.g. positivity 
of the mass or boundary conditions at the horizon for black holes. Note how- 
ever that the regularity on the rotation axis is guaranteed by the regularity 
of P{x). 

Given some asymptotically regular solution P{x) we integrate (79) and 
obtain the corresponding functions P±{w,x). We choose the w-dependent 
constants of integration in P±{w,x) such that in the limit p — >■ 0, z — >• —00 
(i.e. — y 0) 

V+{w,x) — > 1 . (90) 

p — >-0 
z—^ — 00 

Due to the asymptotic behaviour (89) of P{x) we can integrate the differential 
equation (79) along a large circle and find 

V+{w,x) 1 (91) 

r—¥oo 

in the whole asymptotic region. In order to characterize the domain of ana- 
lyticity of P{t,x) we need some definitions. Let X he a, domain of the type 
described in the preceding definition and W the domain in the complex w 
plane given by W = {w: (Reic, | Imw|) € X}. For each point x in the (z,p) 
half plane we define the domains 7±, and 

f±={t:t = t^{w,x),w€W} , r± = c\7^, r" = r+nr". 

(92) 

where W is the complement of W. The transformation t ^ will clearly 
map onto 7(jf and will therefore leave their intersection invariant. 
Furthermore let T>± and T> be the domains 

V± = {{t,x): t€Tl,x€X} , V = V+nV_. (93) 

Starting from the asymptotic value P+{w, x) — > 1 we can, for a fixed value 

r—¥oo 

of w, determine V+{w^x) by integration of (79) along a suitable path in the 
{z, p) half plane. The r.h.s. of (79) is analytic (in x and w) ior x € X except 
for the branch cut starting at a; = (Re u>, | Im i/;|). The resulting V^{w,x) 
will therefore be an analytic function of w and x as long as we can find a 
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path of integration avoiding these singularities. This is possible for all a; € 
except those on the branch cut. In order to determine V- we have to reach 
the second sheet, i.e. w must lie in the domain W. For the moment we need, 
as before, the additional assumption that Im w 0 but it is easily seen that 
the results remain true for Imw = 0. Except for the branch cut the function 
'P-{w,x) will be analytic if (w,x) € W x ff. We can finally analyze, for 
w £ W, the behaviour near the branch point for w € W and find 

P±{w, x) = Pi(w, x) + t±(w, x)P 2 (w, x) (94) 

with Vi and P 2 analytic in a neighborhood of the branch point. 

Using the analyticity of 'P±{w, x) and the behaviour (94) near the branch 
point we deduce that V{t,x) is an analytic function in the domain and 
that V{—j,x) is analytic in Ai{t,x) will therefore be analytic in T>, 
but the domain of analyticity is in fact much larger. Since M±(w) is x- 
independent it cannot have branch points at w = z ± p, i.e. M+{w) = 
M-{w) = M.{w). M.{w) is analytic in W and M{t,x) is therefore analytic 
in the domain 

V = {{t,x): (95) 

Let us now turn to the factorization problem for Ai. For that reason it is 
important to analyze the position of the sets 7J. As long as z < —R, the 
domain lies entirely outside the unit circle and to the left of the imaginary 
axis whereas TA lies inside the unit circle and to the right of the imaginary 
axis. For these x all singularities x) in TJ should be due to V{t, x) and 

those in TA due to 'P{—j,x). If we vary x, these domains and the singularities 
of V{t, x) will move in the complex t plane and will eventually cross the unit 
circle but the two domains 7J will never intersect as long as z + ip £ W or 
equivalently x G X. As soon as z > R, lies entirely inside the unit circle 
(but still to the left of the imaginary axis) and TA lies outside the unit circle. 

We can, therefore, choose a family of contours in the t plane which 
have, for all x £ A, the following properties: 

• is invariant under the mapping t — > — ^ and will thus pass through 
the two fixed points t = of this map. 

• The domain contains a neighborhood of and the domains resp. 
TA lie in the exterior resp. interior of C^. 

• The contours depend continously on x. 

There remains a lot of ambiguity in the choice of these contours, and we 
could e.g. choose the unit circle for z < —R. What we really need is the 
existence of a contour which separates the domains • No such contour can 
exist for x G X because, in this case, both and TA will contain the points 
t = ±i and this non-existence of a suitable contour will lead to singularities 
of V{t, x). 

In view of these properties of the domains of analyticity the factorization 
problem maybe considered as a group- or matrix-valued Riemann-Hilbert 
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problem for the group- valued function A4(t,x) of t for fixed x given in a 
neighbourhood of the curve with factors analytic in domains Tj^ resp. 
with the geometrical properties described above. A simple argument shows 
[21] that a solution V{t^x) of the factorization problem automatically solves 
a LSP of type (79) with suitable currents A{x) and J{x) and thus yields a 
solution of the eq.(75a) for M{x). 

In general the solution of such a Riemann-Hilbert problem is difficult. A 
useful strategy [41,18] is to reduce it to the solution of a system of Fredholm 
equations. In the present case there is however a very helpful connection 
between A4{w) and M{z = w,0), i.e. M{x) on the axis p = 0, derived from 
the relations (84). 



f M{w,0) for w < —R , 
( M“^(w,0) for R<w. 



(96) 



We see from (96) that although A4(w) is ^-independent, it contains enough 
information to determine the behaviour of M(x), and thus P(x) on the axis 
(|z| > R, p = 0). Together with the assumed analyticity properties this is in 
principle sufficient to determine P(x) everywhere in X . As shown by Ernst 
and Hauser [18] the factorisation problem can be explicitly solved along a 
suitable piece of the p = 0 axis employing eq.(96). From this they are able to 
conclude its solvability in the whole domain X . 

Next we turn to the definition of the ‘Geroch group’ acting on the solutions 
of (75a-75c). Given a solution of the linear equation LSP (79) and a function 
g{w) with values in G resp. the complexification of G for complex w we can 
obviously get another one by the transformation 

P{t,x) ^P{t,x)g{w)~^ (97) 



where g{w) is considered as a function g(t, x) = g{w{t, cc)). The only problem 
with this transformation is that in general it destroys the ‘triangularity’ of 
P{t,x), since in view of (83) w considered as a function of t is 

w = z+'^{j-t), (98) 

which is singular at t = 0. In order to compensate for that one can try to act 
from the left with some group element h{t, x) depending on g and P removing 
the negative powers of t and bring P{x) = P(0, x) to triangular form. For the 
cc-independence of Ai it is necessary to choose h invariant under This 

is quite analogous to the standard action of G on the symmetric space G/H 
parametrized with the triangular representatives P. In [21] it was shown that 
the resulting transformations 

V{t,x) — >■ Vg{t,x) = h{t,x)V{t,x)g{w)~^ (99) 

correspond exactly to the group proposed by Geroch [14]. The latter emerges 
in the attempt to act on the potentials parametrizing the matrix M with 
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the group GL{2) mixing the two Killing vector fields K and K' linearly. 
According to Geroch this requires the introduction of an infinite set of further 
potentials - represented here with the ‘generating functions’ U{t,x). The 
action of the Geroch group on these functions was derived in [21] using the so- 
called Kramer-Neugebauer mapping [43] . Since there is no natural extension 
of this mapping to the general class of theories considered in this context 
we prefer to postulate the transformations through (99). This requires the 
determination of h(t,x), which can however be reduced to the solution of 
the factorization problem for M.{w): The point is that the transformation of 
M{w) corresponding to (99) requires only the knowledge of g{w), because 
due to = h we get 

M.g(w) = Tg{w)A4{w)g{w)~^ . (100) 

Hence we can construct Vg making a detour via Mg expressed by the diagram 

V M-^ Mg-^Vg . (101) 

Once we have succeeded to factorize Mg(w) as 

Mg(t, x)) = (j’git, x)~^^Pg(t, x) (102) 

we just define 

h = VggV~^ . (103) 

It is straightforward to show that this h is invariant under 

= T^°°^VgT^°°'>gT’^°°^V-^Vg-^Vg^ = r'^^'^Vg MgV~^ = 1 . 

(104) 

Since we are only interested in V{t,x) corresponding to asymptotically regu- 
lar solutions P{x) we have to impose the condition that g(w) is holomorphic 
in a neighbourhood of w = oo (taking values in G for real w) and g{oo) = 1 . 
In order to give this set the structure of a group we have to form equivalence 
classes of g{wys related through analytic continuation or equivalently take 
the maximal analytic continuation. 

As first demonstrated by Ernst and Hauser [18] the Geroch group acts 
transitively on the class of stationary, axially symmetric solutions of the vac- 
uum Einstein eqs. (G = SL{2)) which are regular in the neighbourhood of 
some given point on the axis. This implies in particular that all asymptoically 
regular solutions can be obtained from Minkowski space. Since the latter is 
represented by M{w) = V{t,x) = 1 this requires the factorisation of M{w) 
in the form M{w) = g{w)'^g{w) with g{w) analytic at w = oo. This can eas- 
ily be done explicitely in terms of triangular matrices in the case of SL(2), 
but in general no such explicit solution exists. In fact, for pseudo-riemannian 
symmetric spaces such a factorisation in ‘triangular’ group elements can only 
exist in a neighbourhood of the unit element. Fortunately this is sufficient in 
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our case in view of the normalisation condition g{oo) = 1 . Thus the transi- 
tivity of the Geroch group remains valid for the general class of models under 
consideration. 

It is sometimes convenient to express the new solution Vg{t,x) as 

B g{s): V(t,x) ^ Vg{t,x) = Z^{t,x)V{t,x) (105) 

and use a slightly different Riemann-Hilbert problem in order to determine 
directly Z^{t,x) [44]. Clearly Z^{t,x) is analytic in the domain (for 
a suitably chosen X) as are 'P{t,x) and Vg{t,x). We find that we have to 
construct 

G{t,x) = 'P{t,x) g~^{w{t,x))'P~^{t,x) (106) 

Z^{t,x) = Q{t,x) (107) 

and decompose Z^{t,x) in the form 

Z\t,x) = Zi{t,x)Zl{t,x) (108) 

where Z^{t,x) is analytic in T>± and Z^{0,x) = Z^{oo,x) is ‘triangular’. 

If the function Z^(t,x) in the Riemann-Hilbert problem is meromorphic 
(e.g. with 2N poles) then the transformation (105) corresponds to an Wfold 
Backlund transformation [41] adding N poles to the solution 'P{t,x) of the 
linear system. For a general V{t, x) it is, unfortunately, practically impossible 
to find an element g{w) € such that the corresponding Z^{t,x) is 

meromorphic. One can, however, investigate what are the conditions on a 
meromorphic Z^(t,x) in order that Vg(t,x) determined by (105) is again a 
solution of the LSP. This method yields algebraic equations for the residues 
of Z^{t,x). This is, in fact, the procedure of Belinskii and Zakharov [19] 
employed in the special case G = SL{2). For this special case Cosgrove [45] 
has shown the equivalence of this method with the Backlund transformations 
introduced independently by Harrison resp. Kramer and Neugebauer [16,17]. 

For the full description of the 4-dimensional solution we need also the 
conformal factor A related to P{x) resp. M{x) through (76). The question 
arises how A changes with the transformations P ^ Pg. In [46] Julia showed 
that A transforms under infinitesimal transformations like a Lie algebra cocy- 
cle defining the central extension of the affine Lie algebra. In [21] this action 
was ‘exponentiated’ and it was demonstrated that there is a central extension 
of g|°°^ defined through a group 2-cocycle Q [47] acting on the pairs 
{V, A) considered again as ‘triangular’ elements of this extended group G^\ 
The multiplication law for this central extension of G^°°^ using the group 
2-cocycle f2 is given by 

(a, e“) o (6, e^) = (o6, . (109) 

There is a corresponding extension of the transformation law (99) to 
(p,A-i)^ (/i(p, 5 ),l)o(iP,A-i)o( 5 ,e^)-i for (g, e'^) S G^^^ (HO) 



300 



Dieter Maison 



Similar to the construction of A4 from V we can define 

((P, o (P, A-i) = (tW, (111) 

where the last expression is obtained using (110). Above we concluded that Ai 
was ^-independent due to the invariance of d'PV~^ under The natural 

extension of this invariance to QcT'^ the Lie algebra of is 

d{V, A-i) o {V, = {dVV-^,0) (112) 

since Q = —Q. 

As a consequence of (112) we find dji = 0, i.e. not only Ai but the whole 
pair {Ai, fi) is a;-independent. This remarkable fact provides us with a nice 
explicit formula for A as a function of V 

= (113) 



(where the constant c can be determined from the asymptotic behaviour for 
\x\ oo). Making use of the properties of the 2-cocycle 17 it is possible to 
show that this expression for A indeed satisfies (76). 

Next we shall demonstrate the factorization of Ai (w) in a simple example 
yielding the Schwarzschild solution. Let us take G = SL{2) and put 



Ai{w) 




Factorizing 

w - c= - ti){i + ti) with ti 
w + c = - t 2 ){j + t 2 ) with t 2 






(114) 



- c)2 + p2) 

^{z + cy + p'^'j 

(115) 



allows to separate the poles of Ai{t,x) in the complex t-plane and to obtain 



P{t,x) 



( /t2~ t-tl Q \ 

V *-‘2 I 

0 I 

\ y ^2 t—ti J 



(116) 



The corresponding solution of the vacuum field equation is 




•0 = 0 , 



. _ (1 + tit2)^ 

- ii + ti)ii + 4) 



(117) 



which is nothing but the Schwarzschild solution of mass to = c in Weyl’s 
canonical coordinates. These expressions simplify to 



A = 



A = 



u + c 



y 2 _ ^2 y 2 



(118) 
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using prolate spherical coordinates {u, v) 

z = uv, p= V(u^-c2)(l-u2), (119) 

Hence the Schwarzschild solution can be regarded as some kind of soli- 
ton obtained through a Backlund transformation from the trivial solution 
A4{w) = 1 using the element 



g{w) 




( 120 ) 



of the Geroch group. The Kerr solution with mass m and angular momentum 
J = am is obtained replacing Ai{w) of eq.(114) by 



M(w) 




( (w + m)^ + 

a (w 




( 121 ) 



where = wi} — a}. Performing again the now rather more complicated 
factorisation of M{w) one obtains = ^^°~^ ) 






Z\ 



-C^tl 

t2—t\ t2 — ti W *1*2 + 1/ 



*2 



— + (au)^ 



( 1-C^ 




/ t2+tl 


2 / \2i 

\ / *i* 2 -n 




u2 — (cu)2 


V 2 


) 




V‘l‘2 + 1 J 








1 *1 


-C^t2 , 


(^tit2-lY 






*2— *1 


*2 — *1 ' 


tlt2 + l J 


V? — 


m2 -1- (au)^ 


/ t2- 


CH 


f 1 ( 


/■ *1*2 — 1 \ 


{u + 


m)2 -1- (au)2 ’ 


V ‘2 


— *1 




^ tlt2 + l J 








(1 


1 ^A2/-*1*2 — 1 
w s; s tit2+i 




2Jv 


/ i2- 


C2t 




A *1*2 — 1 


{u + 


to)2 -I- (au)2 


V ‘2 


— *1 


) ' 1 


^ tlt2 + l J 







( 122 ) 



By repeated application of such group transformations one can obtain ‘Multi- 
Kerr’ solutions first constructed by Kramer and Neugebauer [17] with the help 
of Backlund transformations. Unfortunately none of these seem to be free of 
unphysical singularities [48] . 

In addition to the transformations of the Geroch group we can define a 
non-linear realization of the conformal group of the complex plane (resp. the 
Riemann sphere) on the solutions. For any holomorphic function f{w) leaving 
the point at infinity fixed we may put A4f(w) = [42]. Again 

there corresponds a transformation V —>■ Vf obtained through factorization 
of A4 / . The infinitesimal form (Virasoro algebra) of these transformations on 
the solutions of the Ernst equation was introduced in [49]. More recently Julia 
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and Nicolai [50] discussed an extension of the group to the semi-direct 
product with this conformal group acting on a triple (Y(t, x),P(t, x), A(x}), 
where V (t, x) is the generating function for a set of potentials generalizing 
the function l/w{t,x). 

Finally let us remark that the ‘Inverse Scattering Method’ has also been 
successfully applied to solutions with two space-like Killing vectors, describing 
gravitational plane waves [22]. 



5 Stationary Black Holes 

The remaining part of this article is devoted to applications of the 3-d a- 
model structure found in the preceding sections. We will concentrate on sta- 
tionary space-times which are asymptotically flat (or possibly asymptotically 
NUT). In addition we require that the matter fields tend asymptotically to 
the ‘classical vacuum’ configuration, i.e. the scalar fields tend to constants 
and the vector field strengths to zero. Furthermore we shall assume that the 
space-times considered fulfil the standard causality requirements (compare 
[51], p. 323 for a precise formulation). 

A space-time manifold is said to be stationary if it admits a Killing vector 
field K which is time-like near infinity and strictly stationary if K is every- 
where time-like. Scalar and vector fields living on a stationary space-time are 
called stationary, if their Lie derivative with respect to K vanishes. 

A stationary space-time is said to be static, if it is invariant under time- 
reflections. This implies that the Killing vector field K is hypersurface or- 
thogonal, i.e. the twist vector vanishes. 

In order to define staticity for the scalar and vector fields it is necessary 
to assign to each field a time reflection parity, such that the action (36) is 
time reflection invariant. This assignment is in general not unique. Consider, 
e.g., the Einstein-Maxwell system: the electromagnetic field Fa^ is usually 
considered as temporal vector field strength i.e. the magnetic field 

Ba = *FapK^ vanishes for a static solution. Due to the duality invariance 
of the field equations F^p could as well be considered as temporal pseudo 
vector field strength F^~^> with the consequence that a solution would be 
called static if the electric field = Fa^K^ vanishes. Actually we prefer 
to define staticity in terms of the scalar potentials A, which we split into 
temporal scalars and pseudoscalars Analogously we proceed with 
the scalars of G/F[. In view of the cr-model structure we will however make 
the additional assumption, that the truncation of G/F[ to temporal scalars 
is a consistent truncation of G/H as defined in Appendix A leading to a 
truncated cr-model Ggt/Flst- 

Let us now turn to the study of stationary solutions of the class of grav- 
itational models introduced in the previous chapter. According to a result 
of Lichnerowicz [52] there are no non-trivial globally regular stationary solu- 
tions of the vacuum Einstein equations - a result that can be generalized to 
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the class of theories considered here [12]. From the view-point of harmonic 
maps this is not an unexpected phenomenon [24]. This means that interest- 
ing stationary solutions have to have singularities. In case the singularities 
are hidden behind event horizons we are dealing with black holes, otherwise 
the singularities are called naked. From the physical view-point our interest 
clearly centers around the first alternative - stationary black holes. In the 
following we shall use the cr-model structure to extend in a simple way a 
number of classical existence and uniqueness results for these solutions. 

The uniqueness theorems on static black holes are usually expressed as 
‘No-Hair Theorems’ [26] stating that these black holes are characterized 
uniquely by their ‘gauge’ charges - comprising besides the mass only elec- 
tric resp. magnetic charges, but no ‘scalar’ charges or charges referring to 
global invariances. These ‘gauge’ charges can be read off from the asymptotic 
behaviour of the corresponding gauge fields and are related to Gauss-type 
conservation laws. Hence we have to discuss the asymptotic conditions for 
stationary black holes in terms of the reduced 3-dimensional formulation. 
For the Einstein-Maxwell theory this question has been thoroughly investi- 
gated by Simon [53]. As mentioned by Simon his results can be extended 
to a larger class of theories including those considered here. Accordingly we 
require the following asymptotic behaviour at space-like infinity (holding in 
a suitable coordinate system) of the fields defined on 

hab = ^ab + 0{-) ( 123 ) 

r 

M =Mo + ^Mi + 0(^) (124) 

where Mg, corresponds to the ‘vacuum’ solution. The behaviour of the indi- 
vidual fields (after a suitable choice of gauge) parametrizing M is 

A=l-^+0(^) (125) 

^ =^ + 0 {^) ( 126 ) 

A=9a + o{^) (127) 

M=l-?^ + 0(l) (128) 

where m is the total mass, n is the NUT-charge, Qa a vector of ‘electric’ 
and ‘magnetic’ charges and Q a hermitian matrix of scalar charges. In order 
to have an asymptotically Minkowskian geometry the NUT-charge has to 
vanish. Thus solutions with non-vanishing NUT-charge are imphysical, nev- 
ertheless they appear naturally in our 3-dimensional formalism (their non- 
trivial topology becomes only visible if one reconstructs the 4-dimensional 
space-time!). 
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Next we turn to the event horizon. According to Hawking’s strong rigidity 
theorem [51] the event horizon of a stationary black hole is a Killing horizon, 
i.e. its null generator L coincides with a Killing vector field. Although there 
has been some criticism concerning the strong technical assumptions going 
into its proof [54], we shall assume the validity of this theorem here. Now, 
either L coincides with the KV K for time-translations or there must exist a 
second independent one AT', which is then necessarily space-like near infinity. 
As argued by Hawking it generates axial rotations and correspondingly such 
horizons are called rotating. Since its orbits are closed, causality requires that 
the axial Killing vector K' is space- like on and outside the horizon, apart from 
the rotation axis where it vanishes. 

For non-rotating horizons the exterior domain is either static or not. 
Hawking [51] showed for the vacuum case that strict stationarity of the 
exterior domain implies staticity. The generalization of this result to elec- 
tromagnetism has been achieved only recently by Sudarsky and Wald [55] . In 
contrast to our treatment of stationary solutions based on the 3-d orbit space 
these authors prefer a ‘Hamiltonian’ treatment using fields restricted to a 3- 
d (maximal) hyper-surface. There is no doubt that the ‘Staticity Theorem’ 
holds also for the class of models considered here and it should be possible 
to extend the proof of [55]. 

Regardless of whether the horizon is a Killing horizon or not it follows 
that on the horizon the energy-momentum tensor satisfies [39] 

= 0 . (129) 

This implies that 

• The scalar fields are constant along the null generators 

= 0 (130) 

• The electric and magnetic fields satisfy 

= 0 (131) 

= 0 . (132) 

If L coincides on the horizon with a KV then the surface gravity k defined 
by 

« = ( 133 ) 

is constant on each connected component of the horizon. Similarly the vector 
potentials A are constant there; note however that the scalar fields ^ have to 
be only finite but not necessarily constant on the horizon. 

Let us now consider black holes which are the strictly stationary in their 
exterior domain, which includes in particular the static ones and switch to 
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the 3-dimensional formulation. The horizon occurs at the boundary of S 3 
where A vanishes. The surface gravity of the horizon is given by 

= -h‘^^daAdbA (134) 

4 

which vanishes only for a degenerate horizon. Moreover, being orthogonal 
to the horizon, K satisfies there the hypersurface orthogonality condition 
Kia-pK^] = 0. This implies that u>m vanishes and ip is constant on each 
connected component of the horizon. In addition the geometry determined 
by the physical 3-metric A~^hmn should remain regular as one approaches 
the boundary 0. If k yf 0, i.e. for a non-degenerate horizon, this implies 
that A vanishes as the geodesic distance (in the scaled 3-metric hmn) from 
the boundary and that (in suitable coordinates) the same holds true for the 
induced 2-metric on the surfaces A = const. In terms of the physical 3- 
metric the boundary is a totally geodesic 2-surface [57] and each connected 
component has the topology of a 2-sphere [58,59]. 

The boundary conditions in the stationary axisymmetric case are more 
complicated because using the adapted coordinates p and z introduced in 
Sect. 3.4 we must consider the axis as well as the horizon. They are essentially 
the same as those discussed by Carter [39] in the Einstein-Maxwell case. The 
2-dimensional space is the open half-plane S 2 = {(z,p): p > 0}. Both the 
horizon and (the exterior part of) the rotation axis are represented by the 
the z-axis {p = 0). Each connected component of the horizon corresponds to 
a segment of the z-axis with Z\' < 0 whereas Z\' = 0 on the rotation axis. 

The boundary conditions at infinity (r = \J p^ — > 00 ) are now 

A =1-hO(-) (135) 

V 

M = Mo -f -h O(^) . (136) 

The corresponding behaviour of M' is more complicated due to the non- 
trivial asymptotic behaviour of the axial Killing vector field and involves the 
total angular momentum J. For vanishing NUT-charge one gets 



A! 






A' 




2J(3p^z + 2z3) I ^ 

\T ) 




M' = M . 



(137) 

(138) 

(139) 

(140) 



The conditions at the horizon T~L are simply that M' has to attain a finite 
limit with Z\' < 0 there (distinguishing the horizon from the rotation axis 
where A' vanishes. 
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At the rotation axis Z\' = p^O(l) and M' tends to a finite limit quadrat- 
ically with p. The behaviour of ip' and can be determined using eqs.(75a- 



with some function a{z) and the signs refering to the upper and lower part 
of the axis. 

5.1 Spherically Symmetric Solutions 

A very important class of solutions of the theories considered in the previ- 
ous section are the stationary, spherically symmetric ones. According to the 
Birkhoff Theorem spherically symmetric solutions of the vacuum Einstein 
resp. the EM equations have a further Killing vector and thus are automati- 
cally static if the latter is time-like. But as soon as scalar fields are included 
this theorem doesn’t hold anymore. Usually one assumes that the orbits of the 
isometry group SO(3) in four dimensions are 2-spheres, implying a vanishing 
twist potential ip [51]. Since we will however use the 3-dimensional reduced 
theory, it would be unnecessarily restrictive to make this assumption. Hence 
we make this assumption on the orbits only for the 3-dimensional theory 
manifold S 3 . Using polar coordinates with some arbitrary radial coordinate 
T the metric can then be parametrized as 



Substitution of this ansatz into the action (48) resp. (63) and integrating over 
the angles yields an effective 1-dimensional Lagrangean 



where primes denote derivatives with respect to r. Varying with respect to 
N yields 



75c): 



— SQa + o,{z)p^ -\- O(p^) 

= ±4J T QA^Y~^a{z)p^ + O(p^) 



(141) 

(142) 



ds^ = hrandx"' dx" = N'^dv'^ + f{T)'^{dd'^ + sin^ddifi'^). (143) 




(144) 




(145) 



whereas the variation of (p gives 




With the gauge choice N = p this is just the equation for a geodesic of the 
target space with t as the affine parameter. Putting 7ij(^)^^^ = 
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one can integrate eq.(145) for to obtain 

/^ = sinh(rir)“^ for > 0 (147) 

/2 = _t,2sin(r;r)-2 for < 0 (148) 

f = for = 0 . (149) 

Since for asymptotically flat solutions A turns to 1 at infinity the function 
tends to where r is the geometrical radius of the 2-spheres. Hence 
infinity corresponds to r = 0. Thus each of the geodesics corresponding to a 
solution with the required asymptotic behaviour has to pass through 0o = 
(Z\(O),'0(O),H(O),^(O)) = (1,0,0, 1) for r = 0 and is uniquely determined by 
its tangent vector 4>'{0) there. Using the fact that we are studying geodesics 
of a coset space represented by group elements M the geodesics obviously 
correspond to 1-parameter subgroups. In terms of M the geodesic equation 
is simply 

(150) 

dr \ dr J 

From the asymptotic behaviour we find = —2Q where Q is the 

matrix of global charges lying in the Lie algebra part of G/H. Thus the 
geodesics have the form 

M = . (151) 

Parametrizing the matrix Q of charges one can in principle get explicit ex- 
pressions for general spherically symmetric black holes, although in practice 
this may turn out quite cumbersome. In the following we shall demonstrate 
this on a not completely trivial example, the case of the original Kaluza-Klein 
theory yielding the coset space SL(3)/SO(2, 1). 

Using the methods described in the Appendix one finds that M has the 
parametrization 

/ Z\ — A;j,A + — A/i -I- A~"^-ij) \ 

M = ^-5 -^lA + A-^i)A -^i + A-^AA A~^a\ (152) 

V A~^^ A~^A J 



with Ip = i/j + ^AA. a plays the role of a gravitational potential, A resp. 
A are the electric resp. magnetic potential, /i is the scalar field and ip the 
twist-potential . 

In this case Q is an element of sl{3) satisfying 



Q = —tQ = D with D = \ —1 

This suggests the ansatz 



Q = 



1 



- q 

2s 



(153) 



—2m — s 

q 

n 



P 



n 

-P 

2m — s 



(154) 
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where m is the mass, s the scalar charge, q the electric and p the magnetic 
charge and n the NUT-parameter. Since we are only interested in asymptot- 
ically Minkowskian solutions we put n = 0. 

For black holes A has to vanish at the horizon approached for r — ?► — oo 
and the other fields have to stay finite there [12]. This is only possible if the 
characteristic equation of Q has the form [56] 

i(TrQ2)Q = o (155) 

where 

TrQ2 = i(4m2 -k 3s^ - - p^) = 2v^ . (156) 

From (155) we get the additional relation 



det Q 



^ 2s(4m^ — s^) + (2m — s)q^ — (2m + s)p^^ 



= 0 



(157) 



Solving for q^ and p^ we find 






(2m + s) ^(2m + s)^ — 



4m 



P^ = 



(2m — s) ^(2m — s)^ — 4z;^^ 



4m 



(158) 

In order to simplify the discussion we shall study only some special cases in 
more detail. For > 0 we put p = 0, i.e. we restrict ourselves to strictly 
static solutions lying in GL(2)/S'0(1, 1). 

Exponentiating the matrix 



, / —2m — s — q 0 \ 

Q = - q 2s 0 

0 0 2m- sj 



(159) 



with the additional relations 2v = 2m — s and q^ = 2s(2m + s) derived from 
(156,157) we get 

M = = M„(l - + (C0sh2,r-1) ^,^ 

\ V / 

and hence 

= exp— 2r;r (161) 

2 1 

= —{2m + s — 2sexp2vT) (162) 

2v 

fj.3 A = -^(l — exp2vT) . (163) 

2v 

We can rewrite this in the form 

^3 = cosh^ ^ — sinh^ ^exp2wT (164) 

= cosh^sinh^(l — exp2z;r) (165) 

(1 

with cosh^ i . (166) 

(1 ~ 2ml 
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This way of writing the solution displays it as a G'L(2)-transformed version 
of the Schwarzschild solution given by 



in S'0(1,1), the stability group of the point Mq. As we will show in the 
following this possibility to obtain the most general static black hole from 
the Schwarzschild solution is in fact a rather general phenomenon. 

For = 0 we are not allowed to put p = 0, otherwise we would vio- 
late the regularity of the solution on the horizon. On the other hand the 
exponentiation of Q becomes particularly simple, since = 0: 



A = exp 2vt , /i = 1 , A = 0 



(167) 



with the transformation matrix 




(168) 



M = Mo{l - 2 tQ + 2t^Q'^) 



(169) 



resulting in 



A 




2 



(170) 




(171) 



A = - 




(172) 



A = - 




(173) 




(174) 



where l^l = \/ 2 m(l+^) 2 , \p\ = y/2m{l 



S 

2m 



) 2 and a = rriT. The behaviour 



at the horizon r = — oo is 




( 175 ) 
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A 

A 



M.o(i 



(1 



bl 

V2q 

kl 

\p\ ^ 2m’ 



2m 



)-5 +0 



sqp 

2m\qp\ 






O - 



1 
T 

1 

r ^ 

o( ^ 



(176) 

(177) 

(178) 

(179) 



characteristic for a degenerate event horizon. Since = 0 the three dimen- 
sional space is flat for this type of solution. The solution has no strictly 
static limit {p = 0), because we have to require pq ^ 0. 

All the solutions with < 0 have naked singularities [56] , hence there are 
no black hole solutions in that case. This remains so also for H > 5, where the 
behaviour of the geodesics is quite analogous. For light-like geodesics = 0) 
the condition p ■ q ^ 0 is replaced by more general conditions to be derived 
in the following. 

After this instructive example let us return to the general case for which 
we obtain the following equations for the geodesics describing spherically 
symmetric black holes 



(180) 

(181) 

+ Zi-H(A^M)pr) = 0 . (182) 



From the asymptotic behaviour of the geodesics we learn how to fix the 
arbitrary constants obtained by integrating these equations. 

A-^M^ = -Qa (183) 

^ - A-^A^M^ = 2m (184) 

(185) 

From eq.(150) we get 



^ 1 ,2 1 , 1 dM - , dM . ^ , dA"^ - u,^ ^2 

-{A~^—f + - M-i — , Af-i— - = 2v^ 

^ 4 dr dr dr 



dr 



^dA 

dr 



(186) 



Putting T = 0 in this equation we obtain the quadratic relation 



Q) - 1,QaQa = 



(187) 



Recently this ‘Quadratic Mass Formula’ has been generalized by Heusler [60] 
to arbitrary strictly stationary single black holes of the EM theory and a 
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more general model iirvolviirg a dilatoir and an axion. A very simple proof for 
all the models coirsidered here is giveir receirtly in [28] . 

Although it seems quite impossible to extract the poteirtials in closed 
form from (151) iir the general case it is possible to study the behaviour of 
the geodesics qualitatively. We are particularly interested in the asymptotic 
behaviour when r — ^ — oo, where we approach the horizon. We have to recall 
the boundary conditions on the horizon 

A{—oo) = 0 , M{—oo) = M-h < oo , A(— oo) = < oo . (188) 

These boundary conditions imply in particular that must vanish in the 
limit T — ^ — oo and we obtain the following relations between the charges and 
boundary values at the horizon: 

2m - ^Qa = liin = 2m« , Q = -^(A^QDpr ■ (189) 

T-»--oo dr c 



There are two essentially different situations: 

• > 0. This is only possible for v'^ > 0, because the negative contribu- 

tion in eq.(186) vanishes for Z\ — >■ 0. Obviously A behaves like 

A ^ for T -A — oo . (190) 



Consideriirg A aird M as functions of A one finds A = A-f^ + 0{A) and 
M = M-u + 0{A). The quantity Airm-u is the surface gravity k times the 
area of the horizon. 

• m-u = 0 describes solutions with a degeirerate horizoir. Eqs.(180) together 
with the boundary coirditions eq.(188) show that this is only possible for 
v'^ = 0. The asymptotic behaviour is, in this case, 






2 

QJM-^Qa 



-2 



for r —oo 



(191) 



with A = A-^ + 0{'/A) and M = M-u + 0{A). In addition to the relations 
eq.(189) there is a series of constraints on M. The coefficients Ck of the 
expansioir M = M-u exp^^j^ CkA^ must satisfy the equations 

k{k -\- Q A)Ck — Q aQ^pv = r.h.s. fc = 1 , 2, . . . 

(192) 

where the r.h.s. depeirds only oir Ci with I < k and vairishes for k = 1. 
This is air inhomogeneous linear system for the Ck- For k large enough 
there will be no solutions of the homogeneous system and we can solve 
the equations recursively. For small k (and in particular for fc = 1) we 
must, however, look for solutions of the homogeneous system. A very 
simple example of this situation is provided by the extremal solution of 
the cr-model with G/H = SL(3)/SO(2, 1) and G/H = GL{1) discussed 
above. 
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We may summarize the results of this section stating that stationary 
spherically symmetric black hole solutions are characterized by a time-like or 
light-like geodesic segment \n G/H starting at the point Mq (corresponding 
to r = oo) and running to the boundary Z\ = 0. Due to extra constraints on 
the scalar charges Q in general not every time-like geodesic leads to a black 
hole solution. 

5.2 Uniqueness Theorems for Static Black Holes 

In the following we consider static (as defined above), non-degenerate single 
black hole solutions of eqs.(49a-49e). The term single means that the hori- 
zon of the black hole is connected. The claim is that these solutions can 
be completely determined in the present setting - being just the spherically 
symmetric ones already described above. For pure gravity (no vectors, no 
scalars) according to a classical result of Israel [57] a static non-degenerate 
single non-degenerate black hole is necessarily spherically symmetric and 
thus given by the Schwarzschild solution. Israel’s argument was later im- 
proved and simplified by Robinson [61] and Bunting and Masood-ul-Alam 
[62] showed that the connectedness of the horizon need not be required, but 
follows. This can be interpreted as the impossibilty to bring several black 
holes into static equilibrium. Israel was also able to generalize his theorem 
to the Einstein-Maxwell theory [63]. In this case his proof consists of two 
steps. First he shows that the electric potential A is a function of the gravi- 
tational potential A and then he proceeds similarly to the case of pure grav- 
ity. Note however, that there are static superpositions of degenerate charged 
black holes, the Papetrou-Majumdar solutions [64]. Their uniqueness is un- 
doubted, but not yet completely established [65]. There exists a simplified 
version of Israel’s proof due to Bunting [66]. Making use of the fact that the 
matrix of conserved gauge invariant currents Jm = of the corre- 

sponding 50(2, l)/SO(l, 1) fj-model has a component in the Lie algebra of 
H = 50(1, 1), namely = A~^[AdmA+ (1 — Z\ — A^)dm,A], with vanishing 
‘charge’ J^dE^ = 0 he is able to derive that the current has to 
vanish identically, which allows him to express A as a function of A. Em- 
ploying an adaption of the argument of Robinson permits him to reach the 
the desired conclusion. It is an interesting problem to generalize Bunting’s 
method to the type of theories including scalars considered here. Another 
even simpler possibility could be to generalize Israel’s original method using 
the so-called Bach tensor directly making use of eqs.(64). We shall, however, 
proceed somewhat differently [12] using an argument put forward in [67] 
(compare also [68]). The idea is to transform the vector (electrical) charges 
to zero by a suitable Harrison transformation and then to show that the vec- 
tor fields vanish identically if they carry no charges. In a second step one 
concludes that, once the vector fields vanish, also the scalar fields have to be 
trivial, i.e. constant, which may be considered as a kind of scalar ‘No Hair 
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Theorem’. This way one displays any charged single static (non-degenerate) 
black hole as a Harrison transform of the Schwarzschild solution. 

The first step of this strategy is to prove the ‘No Charges - no Vectors 
Theorem’ [12]: 

For a static single black hole solution of eqs.(49a-49e) with vanshing vector 
charges Qa = ^ Ir=oo dmAdS'"’’’ = 0 the potentials A vanish identically. 
Proof: From eq.(49a-49e) we find 

{A-^A^Md’^A),^ = A-^d^^A^MdruA (193) 

and hence through integration 



/ A-^A'-MdmAdE’^ = / A-^d’^A^MdmAdE 
'h Jsi 



(194) 



(The surface term at infinity vanishes, since the integrand is 0{r ^).) 
On the other hand 






A-^A^Md^AdE’^ = A^ 



A-^MdmAdE’^ 



/H 



= -Al 



A-^MdmAdE'^ = 0 (195) 



due to the boundary conditions on the horizon % and the vanishing of Qa- 
Since M is a positive matrix we get daA = 0 as claimed and thus H = 0 
(being normalized to 0 at infinity). 

The second step is to prove the ‘Scalar No Hair Theorem ’[12]: 

If for a static single black hole solution of eqs.(49a-49e) the vector poten- 
tials A vanish, then the scalar fields are constant. 

Proof For vanishing vector potentials A the scalar field equation in (49a-49e) 
reduces to 

= 0 (196) 

Integrating we get 

0 = ^ / M-^dmMdE’^ = 0 (197) 

J r—oo 



since the contribution at the horizon vanishes because dE^ vanishes like A 
(compare 40). 

On the other hand we can multiply eq.(196) by M and obtain 



{d”‘M)-rn = d’^MM-^dmM 
and again by integration 

f d’^MM-^dmMdE = 



(198) 



r—oo 



dmMdE’^ = 0 



(199) 
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From the positivity of M we deduce dmM = 0 and therefore M = Mg = 1- 
Remark: As can be seen from the proof the assumption of staticity can be 
replaced by the weaker assumption of strict stationarity in this theorem. 
Remark: For non- vanishing vector charges Qa we obtain from eq.(49a-49e) 

i^M-^d'^M + A-^-(^Ad^A^M^ J = 0 (200) 

yielding a relation between the Qa’s and the scalar charges Q 

Q = -liAnQl)pr. (201) 

Combining the two theorems with Israel’s theorem for pure gravity we obtain 
the ‘Generalized Israel Theorem’: 

Any static single black hole solution of eqs.(49a-49e) with a non-degenerate 
horizon is a member of the ‘Schwarzschild family’, i.e. can be obtained from 
the Schwarzschild solution through a Harrison transformation and hence is 
necessarily spherically symmetric. 

As already mentioned the strategy to prove this theorem is to transform 
an arbitrary single static black hole solution of eqs.(49a-49e) via a suitable 
Harrison transformation into one with vanishing vector charge Qa and thus 
vanishing A and constant M . In [12] it was claimed that a suitable Harrison 
transformation doing the job exists without giving a detailed proof. This will 
gap will be hopefully fixed in the final version of [28]. 

5.3 Stationary, Axially Symmetric Black Holes 

The prototype of a stationary, axisymmetric black hole is the Kerr solution, 
depending on two parameters, mass and angular momentum. As proved by 
Robinson [69], this is, in vacuum, the only such single black hole solution. 
Also for the Einstein-Maxwell theory the questions of existence and unique- 
ness have a simple, positive answer in the form of the Kerr-Newman solution, 
depending on four parameters - mass, angular momentum, electric and mag- 
netic charge. The uniqueness of the Kerr-Newman solution has been proven 
by Mazur [27] and Bunting [66]. From our experience with the static solutions 
we expect the uniqueness problem for our class of more general theories to be 
more delicate. In fact, in the static case the charges Q for the scalars cannot 
be given freely, but are determined by the vector charges Qa- On the other 
hand we were able to show that all the static single black holes are members 
of the ‘Schwarzschild family’, i.e. could be generated from the Schwarzschild 
solution via a suitable Harrison transformation acting on M . It is tempting 
to speculate that this situation prevails in the case of rotating black holes, if 
we replace Schwarzschild by Kerr. 

In order to prove the uniqueness of the solutions with given c = mn (‘ir- 
reducible mass’ or size of the horizon), angular momentum J and charges Qa 
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we shall generalize the the uniqueness theorem of Mazur [27]. As already ob- 
served by Mazur himself [70] his theorem can in fact be rather easily extended 
to theories of the type considered here, the essential ingredient being the 3- 
dimensional a-model structure for axisymmetric solutions. In contrast to the 
uniqueness proofs for static black holes using the G/H tr-model the Mazur 
theorem uses the G/H' cr-model based on the matrix M' instead of M. This 
is necessary because only G/H' has positive definite metric as required in 
the proof of the theorem. Moreover this formulation avoids the singularities 
of the matrix M at the ergosurface. 

As discussed in Sect. 3.4 we can choose canonical coordinates p and z 
such that the horizon meets the rotation axis at the two points 2 = ±c. The 
horizon itself is then given by 77 = {p = 0,1-zj < c}. The exterior region of the 
black hole is the union of the two pieces of the rotation axis {p = 0, | 2 | > c} 
with E 2 = {(z,p) : p > 0}. 

The uniqueness of single stationary rotating black holes is based on the 
‘Generalized Mazur Theorem’: 

Two solutions M' = (P/)+P/ {i = 1, 2) of the equation (p(M')-i5™M');„ = 
0 taking their values in the Riemannian symmetric space G/H' and obeying 
the same boundary conditions for single black holes (i.e. equal values for c, 
J and Qa) are identical. 

Since the proof of this theorem is astonishingly simple in the present 
formalism we shall present it here. Putting and using 

(pj™).^ = 0 and J+ = we find 

i (pTra™((Ml)-iM')) ^ = i (pTr M^(J^ - ^ 1 ™))) ^ 

= pTr - JD+M'(J 2 ^ - Ji„)) = pTr{J+mJil) (202) 

with J 12 = p 2('^2 — Integrating this identity over IJ 2 and using 

Tr Ji = 0 we obtain 

/ Tr{J+^J///)pdpdz 

JE 2 

= f ^-Tr - JD) pdU^ 

= [ ^Tr {{{P^{P[rY - Pi{P2y") JS) pdSm . (203) 

J a E 2 ^ 

In order to evaluate the boundary term we use the boundary conditions 
discussed in the beginning of this section. There is no contribution from the 
horizon due to the explicit factor p. At infinity one finds 72 (^ 1 )”^ = l + 77(~) 
and the normal component of J 12 is O(^); hence the boundary term at 
infinity vanishes for any two solutions. In order to analyze the contribution 
from the axis we need the detailed form of the boundary conditions (compare 
eq.(141). For two solutions with the same angular momentum J and vector 
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charges Qa it follows that both ^* 2 (^ 1 )”^ '^12 stay finite at the axis. 

Hence the boundary term at the axis vanishes as well. This implies J7i2 = 0, 
i.e., M 2 ) = 0 leading to M[ = since {M[)~^M '2 — >■ 1 for r — 00 . 

Although the parameters m, J, QaQ and Q determine the black hole 
uniquely they cannot be chosen arbitrarily as the discussion of the spherically 
symmetric black holes has already shown. In fact, it appears that the scalar 
charges Q are completely fixed through the regularity requirements once the 
other parameters have been chosen. 

Similarly to the case of static black holes a possible strategy for a unique- 
ness proof could be to transform the vector charges to zero by a Harrison 
transformation. The problem with that is that the Harrison transformation 
acts on M, based on the reduction according to the stationary KV K. This 
formulation involving A becomes singular at the ergo-surface lying in the 
interior of the domain S 2 - The problem now is that we do not know, if the 
transformed solution still fulfils all the regularity requirements for a black 
hole (Z\' < 0 and boundary conditions for p = 0). In principle this could be 
checked, since the action on M' can be obtained using the Geroch group. 
If this step is successfully done, we can proceed as in the static case and 
conclude that the transformed solution solves the vacuum problem and thus 
is the Kerr solution. Reversing the argument shows that it is possible (un- 
der the provisions made above) to obtain the most general single black hole 
solution from the Kerr solution through a suitable Harrison transformation, 
establishing a ‘Generalized Robinson Theorem’. 
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7 Non-linear cr-Models and Symmetric Spaces 

In this appendix we shall study group theoretical aspects of non-linear a- 
models whith a target space ^ which is a non-compact Riemannian or pseudo- 
Riemannian symmetric space G/ H . 

7.1 Non-compact Riemannian Symmetric Spaces 

Let G be a non-compact real form of some compact Lie group. There is an 
involutive automorphism r: G — >■ G, = 1 such that 

H=^h&G-. r(h) = h} 



(A.204) 
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is the maximal compact subgroup of G and the coset space G/H is a non- 
compact Riemannian symmetric space [11]. In order to parametrize the coset 
space G/H we can choose a group element P as representative for each coset, 
i.e. introduce fields P{x) 



P: P{x) e G . (A.205) 

Customarily, G/H denotes the space of left cosets gH. For purely historical 
reasons we use the space of right cosets which is sometimes denoted by H\G. 
As these two spaces are isomorphic we prefer the notation G/H. The choice 
eq. (A.205) of representatives is obviously not unique and the freedom to 
choose representatives leads to a gauge invariance with gauge group H , in 
addition to the group action of G on G/H 

P{x) ^ h{x)P{x)g~^ , h{x) G H , gGG. (A. 206) 

We can eliminate the gauge group H by choosing one standard represen- 
tative for each coset. We will obtain a particularly simple parametrization of 
G/H if we choose a solvable subgroup (represented by matrices ‘triangular’ 
in the sense of the Iwasawa decomposition [11]) which intersects each coset 
once. Given such a gauge choice (or any other one) the action (A. 206) of a 
g G G will lead to non-standard representatives and must therefore be ac- 
compagnied by an induced gauge transformation h{P{x),g) G H (depending 
in general non-linearly on P and g) in order to maintain the gauge choice 

P{x) ^ h{P{x),g)P{x)g~^ , gGG. (A. 207) 

In order to construct an invariant metric on G/H we consider the 1-form 
dP{x)P{x)~^ with value in the Lie algebra Q of G and its decomposition 

dPP-^ =A + J = {Aa + Ja)dx^ , 

The transformation laws (induced by eq.(A.206)) for A and J are 

A = \{dP P-^ + r{dP P-^)) hAh-^ +dhh~^ (A.209) 
J = ^{dPP-^ -T{dPP-^)) ^ hjh~\ (A.210) 

i.e. A can be interpreted as connection for H whereas transforms H- 
covariantly and both are G-invariant. Given any invariant scalar product 
(•, •) on Q we can define an invariant metric 7 on G/i7 by 

= ■ (A.211) 

If G is simple any such (•, •) is a (positive) multiple of the Killing metric and 
for each faithful representation p: Q ^ J ^ j = p{J) there is a (positive) 
constant c such that 






(A.212) 
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In case G consists of several simple factors one gets corresponding constants c. 
The situation remains essentially unchanged if G contains one abelian factor 
but becomes more complicated in the presence of several such factors {U{1) 
or 50(1,1) -R). 

Note that the scalar product (•, •) on Q is indefinite but the metric 7 on 
G/H IS, (positive) definite. 

Given an r-dimensional faithful representation p of t/ we can project any 
r X r matrix T onto Q,T ^ (^’)pr, such that for every J € Q 

((f)p„:r) = cTr(fi) (A.213) 

and thus (J7)pr = J. This projection satisfies (pTp^^jpr = g{T)pr9~^ for all 
g € G [28]. 

The automorphism t provides us with a canonical embedding of G / H in 
G [71] 

P ^ M = t{P~^)P , t{M) = M~^ (A.214) 

where M is iJ-invariant and transforms covariantly under G 

M^T{g)Mg-\ g€G. (A.215) 

The corresponding current J = ^M~^dM is related to by 

J = DPP-^ = ^P M~^ dM P-^ = PJ P-^ (A.216) 

where DP = dP — AP is the i7-covariant derivative of P. 

The line element eq.(A.211) on G/H can be reexpressed in terms of M 

d<p^d(p^'fij{(j)) = ^{M~^ dM,M~^ dM) (A.217) 

Using this expression or the one of eq.(A.211) for the metric in the cr-model 
action eq.(12) leads to field equations in the form of conservation equations 
for the currents 



= 0 or J“ = 0 or (AT^ 9“M).„ = 0 (A.218) 

where DaJp = — [Aa, There are dim G currents J and conservation 

equations, but clearly only dim G/i7 of them are independent. In fact, the 
currents J obey the identity r(J) = due to t{M) = M~^ . 

We will choose the basis for a representation p such that r{g) = (p~'’)“^ 
for all p e G and thus t{J) = — J7+ for all J ^Q. In such a basis the matrix 
M = p{M) = P+P is hermitian and positive definite. 

Note that M and P are very close analogues of the metric and the moving 
frame (tetrad) in general relativity. We see that the ‘metric’ M is sufficient 
to formulate the cr-model and this remains true if we include vector fields. 
Nevertheless the ‘moving frame’ P with a ‘triangular’ gauge choice yields a 
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very convenient and simple parametrization of G / H . The situation changes 
if we add fermion fields to some of our models (e.g. N = \ supergravity) 
because these fermion fields transform with some representation of H and 
not of G. The ‘moving frame’ P is, therefore, necessary in order to describe 
these fermions. 

7.2 Pseudo-Riemannian Symmetric Spaces 

The situation becomes slightly more complicated if we consider a pseudo- 
Riemannian symmetric space G/H [72]. In this case we have a non-compact 
Lie group G and an involutive automorphism r which defines the subgroup 
H of G as before. In addition there is a different involutive automorphism r' 
commuting with r (rr' = t't = t") which determines the maximal compact 
subgroup H' of G 

R' = |/i G G-. r'(h) = . (A.219) 

In contrast to the case before the parametrisation with ‘triangular’ P’s is 
now no more possible globally due to the fact that some compact generators 
are remaining in G/ H . Thus this parametrisation will be singular for certain 
points (e.g. there may be regular points of G/H with Z\ = 0). The basis 
for a representation p will be chosen such that r{g) = X with a 

hermitian matrix X. If we define M = Xp{M) = P+XP the matrix M will 
again be hermitian. Due to the non-compactness of H the matrix X and thus 
M and the metric 7 will, however, not be positive definite. 

7.3 Consistent Truncations 

Given a non-linear cr-model with target space <P and metric 7 we may be 
interested to study a truncated cr-model with target space ^ C ^ and the 
induced metric 7 = 7 ],^. 

We say that a non-linear cr-model with target space is a consistent 
truncation of another cr-model with target space ^ if C ^ and if every 
solution of the field equations for the 'P cr-model is a solution of the field 
equations for the ^ cr-model as well. 

This is the case iff is a totally geodesic subspace of 
If ^ is a Riemannian symmetric space G/H then every totally geodesic 
subspace of ^ is again a Riemannian symmetric space G/H [11]. The auto- 
morphism r maps the subgroup G G G onto itself, f = t\q and therefore 
H C\ G = H G H . Similar statements hold true if is a pseudo-Riemannian 
symmetric space. 

8 Structure of the Lie Algebra 

We shall now describe in some detail the structure of the Lie algebras of the 
groups G corresponding to the theories discussed in this work [28]. Due to 
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Table 1. List of all symmetric spaces obtained by dimensional reduction from four 
to three dimensions of theories with scalars and vectors (reproduced from Table 2 
in [12]). 



# 


G/H 


G/H 


dimO/R 


k 


1 


SL{n + 2)/SO(n,2) 


GL{n)/SO{n) 


n{n + 1) 
2 


n 


2 


SU{p + l,q + 1) 


U{p,q)/{U{p) X U(q)) 


2pq 


p + q 




S{U{p,l) X [7(1, g) 


3 


SO{p + 2,q + 2) 


SO{p,q) ..50(2,1) 


pq + 2 


p + q 


SO{p,2) X SO{2,q) 


SO{p) X SO{q) ' 50(2) 


4 


SO*{2n + 4//U{n,2) 


50* (2n) SU{2) 

U(n) ^ SU{2) 


n{n — 1) 


2n 


5 


Sp{2n -1- 2; R) /U (n, 1) 


5p(2n;R)/[7(n) 


n(n -I- 1) 


n 


6 


G2(+2) 


5[7(1, l)/[/(l) 


2 


2 


^[/(l,!) X ^[/(l,!) 






7 


A4(+4) 


5p(6;R)/f/(3) 


12 


7 


5p(6;R) X SU{1,1) 


8 


Eei+e) / Sp{8; R) 


5L(6)/50(6) 


20 


10 


9 


Eq(-2) 


5[/(3,3) 


18 


10 


SU{3,3) X ^[/(l,!) 


5(f/(3) X f/(3)) 


10 


Ee(-i4) 

^©‘(lO) X SO{2) 


5[7(5, l)/[/(5) 


10 


10 


11 


A7(+7)/S[/(4,4) 


50(6,6) 
50(6) X 50(6) 


36 


16 


12 


E7(-5) 


50* (12)/[/(6) 


30 


16 


SO*{12) X S'0(2,l) 


13 


Er(-25) 


50(10,2) 


20 


16 


A6(_14) X SO{2) 


50(10) X 50(2) 


14 


Eri+s)/ SO* (16) 


A7(+7)/5[/(8) 


70 


28 


15 


Es(-24) 


-E'7(_25) 


54 


28 


X S[7(l, 1) 


Ae(-78) X 50(2) 



the possibilities to use a space-like resp. a time-like Killing vector for the 
reduction from 4 to 3 dimensions the coset spaces G/H are characterized by 
two commuting involutive automorphisms r and r' : All elements of G that are 
invariant under r form the subgroup H, all elements invariant under t' form 
the subgroup H' , and all elements invariant under tt' form the subgroup 
G ® SL{2). The restriction of r or r' to G is the original automorphism f ; 
the restriction to SL(2) is the automorphism defining the maximal compact 
subgroup SO{2). We may assume that the representation p of G is faithful, 
i.e., that all scalars of the 4-dimensional G/H cr-model couple to vector fields. 
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The group G will then be simple, and Table 1 reproduction of Table 2 in [12]) 
lists all possible cases. We choose a basis ti for the Lie algebra sl{2), where 
= e, Iq = d and t- = k satisfy the commutation relations 



In addition we have the generators Si of g and additional generators hi, 
ai, i = 1, ... ,2k for Harrison resp. gauge transformations, which must form 
doublets under SL{2) in order to reproduce the scale transformations eq.(58). 



and with a 2 fc-dimensional real matrix representation p oi g 
p : Si I— >■ p{si) = Ri , [si, h ■ a] = h - Ria , [si, a ■ a] = a ■ Ria . (B.223) 
The remaining commutators can be determined from the Jacobi identities 



The explicit form of the matrices appearing in these commutation relations 
can be found in [28] for the various coset spaces of Table 1. 
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1 Introduction 

In this article we want to describe what is known about time independent 
spacetimes from a global point of view. The physical situations we want 
to treat are isolated bodies at rest or in uniform rotation in an otherwise 
empty universe. In such cases one expects the gravitational field to have 
no “independent degrees of freedom”. Very loosely speaking, the spacetime 
geometry should be uniquely determined by the matter content of the model 
under consideration. In a similar way, for a given matter model (such as 
that of a perfect fluid) , there should be a one-to-one correspondence between 
Newtonian solutions and general relativistic ones. 

The plan of this paper is as follows. In Sect. 2.1 we collect the information 
afforded on one hand by the Killing equation obeyed by the vector held gen- 
erating the stationary isometry and, on the other hand, that by the Einstein 
held equations. Throughout this paper we assume this stationary isometry 
to be everywhere timelike. Thus ergoregions are excluded. We try as much 
as possible to write the resulting equations in terms of objects intrinsic to 
the space (henceforth simply called “the quotient space”) obtained by quo- 
tienting spacetime by the action of the stationary isometry. Much of this is 
standard. But since none of the references known to us meets our specific 
purposes, we give a self-contained treatment starting from scratch. Since all 
the models we treat are axially symmetric, we add in Sect. 2.2 a second, axial 
Killing vector to the formalism of Sect. 2.1. 

In Sect. 2.3 we first introduce new dependent variables for the vacuum 
gravitational held, namely a conformally rescaled metric on the quotient space 
and two potentials, using which the held equations have an interpretation 
in terms of harmonic maps from the quotient space into the Poincare half 
plane. These potentials, originally due to Hansen, are used in our treatment 
of asymptotics in Sects. 3.1,2. We then formulate the boundary conditions at 
spatial infinity appropriate for isolated systems and prove two basic theorems 
due to Lichnerowicz on stationary solutions obeying these conditions. These 
theorems are manifestations of the above-mentioned principle concerning the 
lack of gravitational degrees of freedom. The first result, the “staticity theo- 
rem” , basically states that the gravitational held is static when the matter is 

B.G. Schmidt (Ed.): LNP 540, pp. 325-372, 1999. 
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non-rotating. The second one, the “vacuum theorem”, states that spacetime 
is Minkowski when no matter is present. Iir Sect. 2.4 we restore c, the velocity 
of light, in the field equations aird show that these teird to the Newtoirian 
ones as c — >■ oo . 

In Chap. 3 we we study solutions only “near infiirity”. (Note that by 
the Lichnerowicz vacuum theorem, such solutions can not be extended to 
all of R'* exept for flat spacetime.) In the Newtonian case such solutions 
are known to have a convergent expansion in negative powers of the radius 
where the coefficients are given by multipole moments. The relativistic situ- 
ation is slightly at variance with our statement at the beginning concerniirg 
the Newton-Einstein correspoirdence: namely, there are irow, corresponding 
to the presence of two potentials rather thair oire, two infinite sequences of 
multipole moments, the “mass momeirts” which have a Newtonian analogue 
and the “angular momeirtum momeirts” which do not. One may now study 
the two potentials and the rescaled quotient space metric in increasing pow- 
ers of 1 /r, where r is the radius corresponding to a specific coordinate gauge 
on the quotient space which has to be readjusted at each order in 1/r. 

The results one finds are sufficient for the existence of a chart in the one- 
point “compactification” of the quotient space (i.e. the union of the quotient 
space and the point-at-infinity) , in terms of which yet another conformal 
rescaling of the 3-metric, together with a corresponding rescaling of the two 
potentials, admit regular extensions to the compactified space. As summa- 
rized in Sect. 3.2 one is then able to find field equations for these “unphysi- 
cal” variables which are regular at the point-at-infinity and in addition can be 
turned into an elliptic system. From this it follows that the unphysical quanti- 
ties are in fact analytic near infinity and this, in turn, implies convergence for 
a suitable 1 /r-expansion (r being a “physical” radius) for the original phys- 
ical variables. Furthermore the structure of the unphysical equations yields 
the result that the (physical) spacetime metric is uniquely characterized by 
the two sets of multipole moments. 

It is remarkable that stationary vacuum solutions satisfying rather weak 
fall-off conditions at spatial iirfiirity, by the very nature of the field equations, 
have to have a convergent multipole expansioir. We believe that the topic of 
far-held behaviour of time-independent gravitatioiral helds is by now reasoir- 
ably well understood. The main opeir problem is to characterize an a priori 
giveir sequence of multipole moments for which the expansion converges. 

Iir Chap. 4 we review global rotatiirg solutions. In Sect. 4.1 we outline 
a result due to Lindblom which shows that stationary rotating spacetimes 
with a one-component huid source with phenomenological heat conduction 
and viscosity have to be axisymmetric. In Sect. 4.2 we describe a theorem of 
Heilig which proves the existence of axisymmetric, rigidly rotating perfect- 
huid spacetimes with polytropic equation of state, provided the parameters 
are sufficiently close to ones for a nonrotatiirg Newtoiriair solution. In Sect. 4.3 
we present the solution of Neugebauer and Meinel representiirg a rigidly rotat- 



Time-Independent Gravitational Fields 



327 



ing infinitely thin disk of dust. In the final chapter we treat global nonrotating 
solutions. 

In Sect. 5.1 we outline the essentials of a relativistic theory of static elastic 
bodies. The remaining sections are devoted to spherical symmetry. It has 
long been conjectured that nonrotating perfect fluids are spherical whence 
Schwarzschild in their exterior region. In Sect. 5.2 we discuss the present 
status of this conjecture. A proof exists when the allowed equations of state 
are limited by a certain inequality. While this inequality covers many cases 
of physical interest, the Newtonian situation suggests that the conjecture 
is probably true without this restriction. In Sect. 5.3 we review spherically 
symmetric perfect fluid solutions. The final Sect. 5.4 gives a short description 
of self-gravitating Vlasov matter in the sperically symmetric case. 

In the subject of time-independent gravitational field of isolated bodies 
there are some topics we do not cover. We do not address the question of 
the conjectured non-existence of solutions with more than one body. (Muller 
zum Hagen [58] has some results on this in the static case.) Furthermore 
we limit ourselves to “standard matter” sources. Thus Black Holes are ex- 
cluded. (For this see the article of Maison in this volume.) We also could 
not cover the interesting case of soliton-like solutions for “non-linear matter 
sources”, starting with the discovery of the Bartnik-McKinnon solutions of 
the Einstein-Yang Mills system (see Bizon [11].) 
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2 Field Equations 

2.1 Generalities 

Let (M, 5 ^[^) be a 4-dimensional smooth connected manifold with Lorentz 
metric of signature ( — I- ++). We assume M to be chronological, i.e. 
to admit no closed timelike curves. Let be an everywhere timelike Killing 
vector field with complete orbits. Thus we do not allow points where turns 
null, i.e. we exclude horizons and ergospheres. It follows (see [26]) that the 
quotient of M by the isometry group generated by is a Hausdorff manifold 
N and that M is a principal R^-bundle over N. Furthermore this bundle is 
trivial, i.e. M is diffeomorphic to x N . The fact that this diffeomorphism 
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is non-natural (whereas of course the projection tt mapping M onto N is) 
plays a role in the formalism we shall now develop. 

Let us introduce the differential geometric machinery necessary for writing 
the stationary Einstein equations in a way naturally adapted to As far 
as possible, we will be interested in quantities and equations intrinsic to N 
( “dimensional reduction” ) . For a similar treatment see the Appendix of [23] . 
We define the fields V and uj\u\ = by 

^ := ^ E < 0 (2.1) 

■= ( 2 . 2 ) 

The 3-form w^j^a vanishes if and only if is hypersurface orthogonal - in 
which case (M, g^,^) is called static. More important than will be the 
2-form given by 

(2.3) 

Given the fields cr^jy and w^j^a carry the same information, since 

= 3E“^^[^tT,,A]- (2.4) 

Equation (2.4) is obtained by expanding the identity = Oj which 

follows from (2.2), and contracting with In a similar way we obtain the 
relations 



(2.5) 

uJp^x<y‘'^ = ( 2 . 6 ) 

We now invoke the Killing equation for i.e. 

^i9pv = 4- = 0. (2.7) 

Expanding uj^^x in terms of we easily see that 

Vp^. = E-i[ap, + (V[pK)C,]], (2.8) 

or, equivalently, 

= (2.9) 

In the static case we have = 0, whence there exist global cross sections 
given by t = const, where = VS/ 

Equation (2.9) implies that 

V[p(E-V,.A]) = 0. (2.10) 

Clearly we have C^t = 0, where r is the 3-form given by t^^a = V~^ojfj_,^x- 
By (2.10) and the identity -|- d(fjr), this implies = 0. Since 



Time-Independent Gravitational Fields 



329 



dr is a 4-form and is nowhere zero, we infer in 4 dimensions that dr is 
zero, i.e. 

V[^.{V-^u;,Xp]) = 0 . ( 2 . 11 ) 

Equations (2.10,11) are integrability conditions for the Killing equations (2.7) 
which are “purely geometric” in that they do not involve the Ricci (whence: 
energy-momentum) tensor. Now recall the relation 



= -R^Xp^^p, 


(2.12) 


which follows from (2.7) and its corollary 




g^'^y.^p^p = -Rp''^.- 


(2.13) 


From (2.2), (2.7) and (2.13) we find that 




'^^i^pux = 2^\^Rx]pi^, 


(2.14) 



which, using (2.8), implies 

V'^(y-V^,) = (2.15) 



where we have also used (2.6,7). Interpreting = Rf^^, — \gpuR as the 

energy-momentum tensor of matter, the r.h. side of (2.14) is zero iff the 
matter current, for an observer at rest relative to is zero. In that case, 
and provided that M is simply connected, there exists a scalar field w, called 
twist potential, such that 






(2.16) 



and then (2.11) implies 

= 0. (2.17) 

Note that, by virtue of = 0, w satisfies = 0. 

Next, using the definition (2.1) and eq. (2.8), it is straightforward to show 
that 



= -2Rpx.p^^e + 2V-^[a^xrr.^ - (V^K)^(^a,);, 

+ (2.18) 

and whence 



+ V~\VVf + 2V~^af,^a>^r (2.19) 

Now recall (see e.g. the Appendix of [23]) that there is a 1 — 1 correspondence 
between tensor fields on M with vanishing Lie derivative with respect to 
and such that all their contractions with and are zero - and ones 
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of the same type on N. In the case of covariant tensor fields on N, this 
correspondence is the same as pull-back under tt. Examples of such tensor 
fields on M are the scalar field V, the symmetric tensor field 

( 2 . 20 ) 

and the 2- form . Note that can also be written as 

= ( 2 . 21 ) 

The tensor is, of course, the natural Riemannian metric on N . The co- 
variant derivative associated with acting, say, on a covector living 
on TV, is given by 

D^X, = h/h/v^,X,,. ( 2 . 22 ) 

Denoting by the curvature associated with we find, using (2.9), 

that 



^/iivAcr ^(7 i>' X' a' ^^v^Xp ^ ^^X[p^y]p ^p\p^u]}^' 

(2.23) 

Since N is 3-dimensional, there holds 

<yp[,y<yxp\ = 0, (2.24) 

so that 

R/ij/Ap = hx^ hpP Rp'u'X'p' + 3V~^ap^axp. (2.25) 

Thus 

Rp. = h/h/Rp.,, - V-^Rp,,xp'C'e' + W~^apxa,\ (2.26) 
Using (2.18), (2.26) finally leads to 

Kp. = h/h/Rp>,, + ^V~^DpD,V + 2V~^apxcJ,^ - \v~^{DpV){D,V). 

(2.27) 

From (2.19) we deduce that 

D^V := h^^''DpD^V = + ]^V~^{DVf + 2V~^ap^af^'' . (2.28) 

We now make the following observation: when t^,..a is an arbitrary tensor on 
TV, there holds 

h/ ■ . . hx^'X^Tp,p..x> = i-V)-^^^D^[{-Vy/^Tp,...x]. (2.29) 

Applying (2.29) to (2.15) it follows that 



(2.30) 
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Finally, projecting (2.10) down to TV, it follows that 

= 0. (2.31) 

Given the spacetime with the Killing vector under the con- 

ditions stated at the beginning of this section, there are coordinates (t,a;*) 
on M, such that the canonical projection tt takes the form tt : (t,a;*) i— >■ (x®), 
with X® local coordinates on TV and such that the Killing vector takes the 
form ^ = d/dt. In terms of such coordinates tensor fields on TV, say 
can be viewed as the tensor fields 

= (5 ^® . . .5^^n,„j{x). (2.32) 

Since = V, there holds 

= V{dt + ipidx^), (2.33) 

for some 1-form ipi. Note that, in the tangent space at each point (T, x®) € M, 
the g^j,-orthogonal complement of is spanned by (fidjdt + 9/9x® and the 
orthogonal complement of in the cotangent space is spanned by <Tx®. From 
the definition it follows that 

g^i.dx^dx®' = V{dt + Lpidx'‘Y + hijdx'‘dx\ (2.34) 

where V, (pi, hij on the r.h. side of (2.34) are all independent of t. It is now 

straightforward to check that 

cr^iydx^dx®' = ^^x])dx^ dx'' = V^d[i(pj]dx'‘dx^ . (2.35) 

Thus tTpi,, viewed as a tensor on TV, is given by 

aij = V'^d[iPj]. (2.36) 

In the static case t can be chosen so that pi = Q. 

Conversely, let us start from the 3-manifold (TV, hij, V,aij) with Rieman- 
nian metric hij, a negative scalar field V and the 2-form cr^, subject to 

^[z(^-'^,fe]) = 0, (2.37) 

which corresponds to (2.31). Suppose, moreover, that TV has trivial second 
cohomology. Then there exists a covector pi on TV with 

aij = V'^D[,PjY (2.38) 

Define M = {t G R}xTV and define on TV the Lorentz metric g^^i, by (2.34) and 

by ^ = d/dt. Then one checks that = V, that, under the projection 
7T : AT — >• TV, hfj_i, is the pull-back of hij and that is the pull-back of 
aij = V'^D^iPj-^. The fact that the product structure of TVf as TVf = x TV is 
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not natural is reflected in the above construction by the fact that ipi, solving 
(2.38), is given only up to tpi ^ (pi = + DiF, with F a scalar held on 

N. Under this change given (2.34) remains unchanged only when we set 
t = t — F. 

Given the fields (hy , V, Uij) on N , we can define the fields r, rj, by the 
following equations: 

= -2r + ]^V-^{DVf + 2V~^aija^^ (2.39) 

Z5*[(_F)-i/V,,] = {-Vy/\ (2.40) 

+ \v~^D,D,V + 2U-3a,fefr/' - \v~^{D,V){DjV). 

(2.41) 

It then follows from our previous considerations that the spacetime (M, g^i,) 
satisfies 



Rfj,udx^dx'^ = r{dt + ipidx^)^ + 2ridx\dt + pidx^) + Vijdx^dxF (2.42) 



In particular, iff r, r,, are all zero, is a vacuum spacetime. In this 

case we refer to (2.39,40,41) as ‘the vacuum equations’. 

For later use we record another form of the held equations 

(2.43) 



where 



Tf^^dx^dx’' = r{dt + ipidx'^Y + ‘^Ti{dt + (pjdx^)dx'' + Tijdx^dx^ , (2.44) 

and where we set 

g^vdx^dx'^ = —e^^{dt + ipidx^)'^ + e~"^^hijdx^dx^, (2-45) 

given by 

D^U = T + f/) — uJijU)^^ (2-46) 

= ne-^^Tj (2.47) 

Rij = 2{D,U){DjU) - 2e*^u)tkU)j'^ + Coktd)^^ + i^{Tij - hjf/)- 

(2.48) 



Here 



(jJij — UJij 



and indices are raised with . 






(2.49) 
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2.2 Axial Symmetry 

We now assume the existence of a second, spacelike Killing vector 77 ^ on 
{M,gfj_u). There is the following identity 

4S/f^{r]ipUj^^x]) = + ^S.^Rp.[A\-np\, (2.50) 

where uj^vx is given by (2.2) and we have used (2.14). Suppose, in addition, 
that ^ and 77 commute. Then the first term on the right in (2.50) vanishes so 
that 

4V'^(^[p^7..A]) = (2.51) 

In an analogous manner 

= -6r7'^i?p[.6^p], (2.52) 

where uj'^^x is given in terms of 77 in the same way as ujpuX is given in terms 
of The r.h. sides of (2.51,52) are zero (at points where ^ and 77 are linearly 
independent) iff the timelike 2 -plane spanned by ^ and 77 is invariant under 
Rp!^ ■ These conditions will be satisfied when the energy momentum tensor 
is that of a rotating perfect fluid. We now assume that 77 ^ has an axis, i.e. 
vanishes on a timelike 2-surface which is tangent to . Then, and when the 
r.h. sides of (2.51,52) are zero, it follows that 

V[p^puX] — ^[p^pvx] ~ (2.53) 

The relations (2.53), in turn, are nothing but the conditions for the 2-plane 
elements orthogonal to ^ and 77 to be integrable (“surface transitivity of ^ 
and 77 ”). The above result is due to Kundt and Triimper [38]. 

For the purposes of Sect. 4.2 we need to transcribe the relations satisfied 
by 77 ^ on the quotient manifold N . Writing the 1-form rip = gp^r]'^ as 

rjpdx^ = g{dt + (pidx^) + gidx'", (2.54) 

so that 

the Killing equations 

V^dxgpv + ‘^gx(pd„)g^ = 0 (2.56) 



gW.V = 0 (2.57) 

2w„-77^' = D,{V-^ri) (2.58) 

= 0, (2.59) 



are equivalent to 
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where := The surface transitivity conditions (2.53) get translated 

into 



■niiDjink] = 0 (2.60) 

= 0. (2.61) 

In particular, rf is a hypersurface-orthogonal Killing vector on (TV, hij). Sup- 
pose, now, that the energy momentum tensor is that of a rigidly rotating 
perfect fluid, i.e. 

= {p + p)Ufj.u„ + (2.62) 

with 

u^l = + 17 = const, (2.63) 

and / is chosen so that is future-pointing and = —1. With this 

specialization the quantities r, r^, Tij entering in the held equation (2.46,47,48) 
become 



T = p{p + p){-e^^ + QriY - (2-64) 

Ti = + nri){p + p)firii (2.65) 

Tij = pe~'^^hij + fn‘^{p + p)pi7]j, ( 2 . 66 ) 

where rji = hijpP The normalization factor / is given by 

/ = [e-2^(-e2^ + Qpf - (2.67) 



The held equations have to be supplemented by the Killing relations (2.57,58,59). 
Note that these imply that p and p are invariant under 77® (in addition of 
course to being invariant under d/dt). Under these circumstances the con- 
tracted Bianchi identities, which imply that 

= 0 , ( 2 . 68 ) 

boil down to the relation 

{p + p)f-^DJ = D,p, (2.69) 

the remaining condition, namely Tj) = 0, being identically satisfled. 

2.3 Asymptotic Flatness: Lichnerowicz Theorems 

Before stating the conditions for stationary spacetimes to be asymptotically 
flat, we elaborate somewhat more on the vacuum held equations. First recall 
from (2.17) that, when M (or equivalently: N) is simply connected, there 
exists a held ui on N such that 



(2.70) 
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where we have used (2.16) and 

Eijkdx'' dx^ dx^ = {—Vy^^^^£fiv\adx‘'dx^dx'^. (2-71) 



(Of course, the existence of uj could have also been inferred from (2.40) for 
Xi = 0.) We now rewrite the vacuum equations in terms of the conformally 
rescaled metric hij (see (2.45)), given by 

hj = {-V)h,j. (2.72) 

Then (2.39,40,41), together with (2.70) lead to 

D'^V = V~^{DVf -V~^{Dujf (2.73) 

D^lo = 2V-\Dlo){DV) (2.74) 

and 

Rb = lv-^mV){D,V) + {D,u;){D,u;)l (2.75) 

or 

!^{D,V){D^V) + {D,iu){D^cu) - ^hj[{DV)^ + {Duj)^]j . 

(2.76) 

We can now give an interesting geometric interpretation of the vacuum equa- 
tions (2.73,74,76). Namely, let V be the Poincare half-plane with metric qab 
given by 

QABdz^dz^ = V~^{dV^ + dco^) (V > 0, — oo < uj < oo). (2.77) 



Viewing {z^ (x) , z'^ {x)) = (V(a;), w(a;)) as a map from (N,hij) to {V^qab), 
one easily checks that (2.73,74) are exactly the conditions in order for this 
map to be harmonic, in other words 

D^z^ + = 0. (2-78) 

where denotes the Christoffel symbols of Qab, composed with z^{x). 
The metric hij(x), of course, is not given, but has to satisfy (2.51). The r.h. 
side of (2.51), in turn, is nothing but the energy momentum tensor of the 
harmonic map. {V,qAB) can also be viewed as a spacelike hyperboloid in 
(2-|-l)-dimensional Minkowski space. Namely, define fields 



= 






■ U!'^ — 1 



-W 



<Ps = - 



2V’ 



d>K = - 



-k -k 1 
4y ^ 



(2.79) 



Then 

+ (2.80) 

Viewing (^ic,^Mj^s) as coordinates on with Lorentz metric A{—dF\ + 
d<P\j + d'P'g), the induced metric under the map (2.79) is nothing but qab- 
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The fields 'Pm,^s are the potentials first introduced by Hansen [25] which 
we shall use in Sects. 3.1 and 3.2. 

As with any harmonic map, we can associate a conserved current on 
N with any Killing vector on the target space V. Since V has <S'0(2, 1) as 
isometry group, there are three independent such Killing vectors, namely 



1 

V 



A 



2 

V 



A 



3 

V 



A 



d 

dz^ 

d 



A 

doj 



dz^ dV 



d 



du! 

d d 1 2 

— - = uY 1 — 

Q^A OTC ^ o 



dV 




(2.81) 

(2.82) 

(2.83) 



We note in passing that the SO(2, 1) isometry of V is closely related to the 
“Ehlers transformation” discussed in the article by Maison in this volume. 
The conserved current ji associated with any Killing vector on V is given 



by 

ji = Z^,iV^qAB- 


(2.84) 


Hence 




A = V~‘^DiUj 


(2.85) 


1 = V~^D,V + V-^uD,uj 


(2.86) 


j, = V~^ujD,V + (2K)-2(o;2 - V'^)D,uj 


(2.87) 


1 

are all divergence-free on (A^, hij). By (2.70) and (2.38), ji 


is also equal to 



ji= 



(2.88) 



1 

and so the “charge” associated with ji is always zero. In the asymptotically 
fiat we shall turn to later, (2.87) will be identically zero. 

The quantity (2.86) has the following spacetime interpretation (compare 
[23]). Let 27 be a 2-surface in M which projects down to a smooth 2-surface 
on N. Then there exist local coordinates {x^) = (t,x*) such that 27 is given 
by 

x'^(/) = (0,^XA)), A =1,2. (2.89) 

Now integrate the quantity over 27. After some computation one 

finds 

f)x^ 

= (-Vr''^{d,V (2.90) 

where, as before, <7^ = Now, using (2.70), 

-2{-V)-^/^a,jip^ + (-K)-3/2o;Aw = {iVEij^ipk). 



(2.91) 
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Thus, when S is closed, the integral I of the expression (2.91) is given by 

(hij = {—V)hij) 



1= {-V)-^{D,V + V~^ujD,uj)dS\ (2.92) 

J E 

The fact that this integral in vacuum only depends on the homology class of 
E arises, in the spacetime picture, from the fact that 

= 0, when = 0. (2.93) 

The quantity 

M = ^ I (2.94) 

on 

is called the Komar mass of For the Schwarzschild solution it coin- 

cides with the Schwarzschild mass when the “outward” orientation is chosen 
for dS\ 

We now come to the 



Boundary Conditions 

Recall that we require (M, g^u) to be connected, simply connected and chrono- 
logical. Let, in addition, M contain a compact subset K and let M\K be 
an “asymptotically flat end”. (The results of this subsection will remain to 
be true if M\K consists of finitely many asymptotic ends.) This means that 
M\K should be diffeomorphic to Mr {R > 0) with 

= {(a:°,a;*) G X (R3\B(i?))} (2.95) 

with B{R) a closed ball of radius R. In terms of this diffeomorphism, the 
metric in M \ K has to satisfy that there exists a constant C > 0 such 
that (see [3]) 

Ig/j.i'l + \g^'^\ + ~ dtJ-A + + 'r^^°^\d^dpgy,i,\ < C (2.96) 

9oo<-C~\ 9^°<-C-^ (2.97) 

V X* G R^ 9ijX^X^ > C~^ (2.98) 

We assume a > 1/2. Furthermore we require R^i, to be zero in M \ K. 
(This latter condition could be considerably relaxed.) It now follows that 
the level set = 0 is a spacelike submanifold of XI \K which has a finite 
ADM-momentum (see [3]). If is a timelike vector (which it will be 
for ‘reasonable’ matter except in the vacuum case), it now follows from the 
timelike character of that it has to be an asymptotic time translation, i.e. 



+ r^\dpd,e\ < Cr-^ 



(2.99) 
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where the constants satisfy 



< 0 ( 2 . 100 ) 

(see [3]). Furthermore it follows from [3], that in M\K, or a subset thereof 
diffeomorphic to Mj^i for sufficiently large R' > R, there are coordinates 
{t, y*) in terms of which is again asymptotically flat with the same a > 1/2 
and so that is of the form d/dx^ = d/dt. Hence, in the coordinates 



which we now call (t,a;*), the metric 

g^j^^dx^dx’' = V{dt + ipidx'^)^ + hijdx^dx^ (2.101) 

satisfies 

\V +\\+r\diV\+r^\d,djV\ < Cr~°‘ (2.102) 

\ifi\ + r\^jtp^\ + r‘^\dkdjipi\ < Cr~°" (2.103) 

\hij - I + r\dkhij\ + r'^\dkdehij\ < Cr~°‘ (2.104) 

in M\C. It follows that 

r\cFij\ + r‘^\dk(Jij\ < Cr~°‘. (2.105) 



We remark that the time coordinate t, which is at first only defined in the 
open subset M \ K oi M, can be (Kobayashi-Nomizu [36]) extended to a 
smooth global cross section of tt : M — >■ iV. 

We now state and prove two uniqueness theorems due to Lichnerowicz 
[41], which are basic for the theory of stationary solutions. 

Staticity theorem: Let be asymptotically flat with a > 1/2, 

be an asymptotic time translation. If the matter is non-rotating relative to 
i.e. Ti in (2.40) is zero, then the spacetime is static. 

Proof: From (2.40) we have 

= 0. (2.106) 

Contract (2.106) with , using It follows that 

= (-y)-5/V,,a*L (2.107) 

Now integrate (2.107) over N. Since the term in brackets on the left is 
0(r~^“^“), the boundary term at infinity gives zero. Consequently aij = 
0 => uj^^x = 0 . 

Remark: Since = 0, the field ipi is of the form ipi = DiF, where F = 
0(r^~“). In the coordinates t = t — F, g^^^, takes the form 



g^vdx^dx’^ = Vdf' + hijdx^dxF 



(2.108) 



Time-Independent Gravitational Fields 



339 



Vacuum theorem: Let (M, satisfy the conditions in the staticity theo- 

rem and let (M, in addition be vacuum. Then (M, is the Minkowski 
space. 

Proof: Firstly, by the staticity theorem, we have that cr^ = 0. Using this in 
(2.39) for r = 0 we have with v := (— that 

D'^v = 0. (2.109) 

By the maximum principle, or multiplying (2.109) by integrating by parts 
and using /r — 1 = 0(r~“), difi = 0(r~^““), we infer that /r = 1. Now (2.41) 
implies M.ij = 0 => M.ijke = 0 since dimiV = 3. Since N is simply connected, 
it follows that (N,hij) is flat R^. Thus 

Qfj^udx^dx" = —dt^ + Sijdx’’dxP ( 2 . 110 ) 

2.4 Newtonian Limit 

Ehlers showed (unpublished, see [49]) that one can write the held equation 
containing a parameter A = c~^ such that the equation remain meaningful 
for A = 0 and then they are equivalent to the Newtonian equations. The 
variables for which this is true in the time dependent case have to be chosen 
in a quite sophisticated way. The stationary case can be treated in a direct 
and simple way as follows. 

We write the metric as 

g^^dx^dx’' = —e~^ {cdt + ipidx'^Y + hikdx’‘dx^ ( 2 - 111 ) 

where we inserted “c” by dimensional analysis. The held equations decom- 
posed in section 2.1 

SttG , 1 , , , 

(2.112) 

for the energy momentum tensor 

= c^T{cdt + (fiidx^)'^ + 2cTi{cdt + (fjdx^)dx’' + Tijdx'‘dx^ (2.113) 

become 

D^U = 47rG(e"^T -k c'^f/) - (2.114) 

= SirGc-^e-^Tj (2.115) 

Rij = 2c~'^DiUDjU — 2e^ + hije^ + 87rGc“^(rij — h^f/) 

(2.116) 

Considered as equations for U,hij,4>i,T,Ti,Tij, (2.114-116) have a limit for 
c — >■ oo. 

In the static case the limit is 



D^U = 4ttGt 



(2.117) 
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i?^fe = 0 (2.118) 

Hence we obtain immediately that the metric hik of the quotient is flat and 
therefore (2.117) is the Poisson equation of Newton’s theory. The connection 
has also a limit and the only non vanishing Christoffel symbol is = D'’U. 
The equation of motion = 0 becomes in the limit the Newtonian 

equilibrium condition D^Ttj = —rDjU. 

Now to the stationary case: Because the right hand side of (2.115) vanishes 
in the Newtonian limit, the Lichnerowicz theorem implies that ujik = 0 which 
in turn implies by (2.116) that Rik = 0 whence the metric on the quotient is 
again flat. 

For the metric written in the form (1) the connection has no limit for 
c — > oo. If we use however as a consequence of the held equations that LUij = 0, 
the connection has a limit and the equations of motion become the Newtonian 
equilibrium conditions, i.e. 

DWi = 0 , D^Tij = -tDjU . (2.119) 

2.5 Existence Issues and the Newtonian Limit 

The fact that the equations can be written to contain A = c~^ in such a 
way that they are analytic in A and are the Newtonian equations for A = 0, 
suggests to use this structure for existence theory. In this section we will make 
some remarks about the static case. In section 4.2 an existence theorem for a 
rigidly rotating body by Heilig will be discussed which exploits the fact that 
the equations have a nice Newtonian limit. 

To obtain partial differential equations for which there is an existence 
theory we write (2.104) and (2.105) in the static case in harmonic coordinates 
on iV, defined by = 0, for the unknowns U and 2’d defined by = 
jb + and obtain: 



AU := S^^didjU = dyrGr + H(A, r, , Z®^) (2.120) 

= -AdkUdiU5^^5^^ - 167rG(r®^ - rJ®^) 

+ X^B^^{\T,TkuZ^\dmZ^\dmdnZ^^) ( 2 . 121 ) 

here we used the well known expression for the Ricci tensor in harmonic 
coordinates 

R®J = -^h^idkdih^^ + dh) (2.122) 

where is quadratic in the first derivatives of h®-^. As usual we call (2.120), 
(2.121) the reduced held equations. These form a quasilinear elliptic system 
with the property that for given small sources r, Tij of compact support and 
small A there exist unique solutions G, Z®-^ which tend to 0 at infinity. 

In particular we can choose for r, Tij a Newtonian solution and determine 
then for small A a relativistic solution of the reduced held equations which 
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have a Newtonian limit. Is is to be expected that the solution will be analytic 
in A. Then the Taylor expansion in A can be considered as a converging post 
Newtonian expansion. 

A solution of the reduced field equations is only solution of the field equa- 
tion if it satisfies the harmonicity condition or equivalently if = 0 

holds. To solve the reduced equations and the equation of motion is a much 
harder problem. It makes only sense once a matter model is chosen. 

In the static case only matter models of elasticity lead to new interesting 
problems because, as we will see in Sect. 5.2 , static fiuids are spherically 
symmetric and can be investigated by ordinary differential equations (see 
Sect. 5.3). Some remarks on static, small self gravitating bodies can be found 
in Sect. 5.1. 

For a stationary rigidly rotating fluid Heilig has given an existence the- 
orem by perturbing away from a Newtonian solution. We will describe this 
result in Sect. 4.2. 

3 Far Fields 

3.1 Far-Field Expansions 

While, as we have seen, little is known so far about globally regular, asymp- 
totically fiat solutions to the stationary field equations with reasonable matter 
sources, there is an almost complete understanding of the behaviour of gen- 
eral asymptotically fiat solutions near spatial infinity, which we now describe. 

Here the quotient manifold N is of the form 



On N there are given the fields {hij,V, (pi) satisfying (2.39,40,41). The whole 
discussion is “local-at-infinity” . In particular one has to allow for R in B{R) 
to be made suitably large, as one proceeds. We will do so tacitly without 
changing the letter “i?” . The Einstein equations are given by 



By the asymptotic conditions (2.102-104), lo tends to a constant at infinity. 
Subtracting this from ui, and calling the result again to, we find that 



N = R^\B{R). 



D^V = V~^{DVf - V~^{Du}f 
D'^uj = 2V~\Duj){DV) 

G,, = ^V-^{D,V){D,V) + 






(3.1) 

(3.2) 



\lo\ + r\diUj\+ A\didjio\ < Cr “. 



(3.4) 



In short, we have that 



F=-l + 0(r-“), a; = 0(r-“), = 0(r"“), 1 > a > 1/2 



(3.5) 
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and that these relations may be differentiated twice. The condition a > 1/2 
could be relaxed (see Kennefick and O Murchadha [34]). It now follows that 



AV = 0(r 
Alu = 0{r 



-2-2a 



) 



(3.6) 

(3.7) 



Since the r.h. sides of (3.6,7) decay stronger than 0{r~^), it follows from 
standard results in potential theory [33,73] that there exist constants M, S 
such that 



R=-l+ — +0(r-i-“) 
r 



UJ = 



= — + 0(r-i-“). 



(3.8) 

(3.9) 



But the existence of (pi in (2.36) implies that S has to be zero. 

Equation (3.3), which involves second derivatives of the metric hij, yields 



= 0{r 






where kij = hij — Sij, k = ka and 



(3.10) 



7 / — dj k^j ^&ik. 



(3.11) 



This equation can be viewed in two ways, both of which recur in the higher- 
order steps leading to the theorem below. Firstly, in the gauge where Ti = 0, 
i.e. the linearized harmonic gauge for hij, it is an elliptic equation, namely 
essentially the componentwise Laplace equation, for the leading-order part of 
kij. Secondly, (3.10) can be rewritten as 



^iim^jnp^i^nkmp — 0(r ) 



(3.12) 



which expresses the fact that the linearized Riemann tensor of hij decays 
faster than Note that (3.12) makes essential use of the three-dimen- 

sionality of space. It follows [73] that there exists gi = 0(r^~“) such that 



kij — diQj + djgi - 1 - 0 ( 7 ’ ^“). 



(3.13) 



Thus the leading-order contribution to the metric hij is “pure gauge”. 
To next order in 1/r one finds that there is a gauge, namely 



n = 0 {' 



, — 3 — Q: 



), 



for which 



V = -l 

2SiX^ 



2M 

r 



2MiX^ 2M^ 



0(r 



-l-2aN 



0(t’-i-2“) 



(3.14) 



(3.15) 



LO 



(3.16) 
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and for which hij can be brought into the form 



k 



ij ~ 



- XjXj) ^ 



(3.17) 



In the above M, Mi, Si are constants. All indices are lowered and raised with 
5ij. When M ^ 0 one can, by a rigid translation, arrange for Mi = 0. In that 
case the metric g^i, obtained from (3.15-17) coincides, to order 1/r^, with 
that of the Kerr spacetime with 151 = —Ma, M being the mass and a being 
the Kerr parameter. 

In order to extend the above result to higher orders in 1/r, it is convenient 
to replace {V, uj) by some other choice of scalar potentials. One choice, due 
to Hansen [25], is to set (see (2.79)) 



4>m 

4>s 

(t>K 



V^+uj^-1 

4K 

u 

W 

V^+oj^ + 1 
4K 



It then turns out that (</)„) = (c/)m, 4>s^ 4>k) all satisfy 



D'^(j)a = 2R(j)a. 



(3.18) 

(3.19) 

(3.20) 



(3.21) 



Then one has [73] the following 



Theorem: There exists a gauge, namely that where 17 = 0{r ™ ^ “), for 
which there are constants A ... ,B ....... ,G .. . such that 



m— 1 



1=0 



= 2^ 

e=o 






1 



=T+E 



-I- 

il-l 



, X T . . X 



e=o 






+ 0°°{r 



oo / — m+1 — 2a\ 



Note that E = M, 



(3.22) 

(3.23) 

(3.24) 



hij — Sij + 

i=2 



rp . nrt . - rp'^X rp'^f. — 2 



p2i 



ft B 

^21-2 









T) ■ ■ - 



X ^ . X 



p2i-2 



p2i-^ 



+0^(r 



— m+1 — 2 q;' 



) ■ 



(3.25) 
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All constants are symmetric in their i\ . . . indices. The constants D are 
also symmetric in i and j. The constants C . . . appear only for m > 3, the 
constants D . . . only for m > 4. The symbol means that the quantity 

in question is of 0{r’^), its derivative is a.s.o. Furthermore all con- 

stants are determined by the tracefree parts of E . . . , F . . . in a way which 
does not depend on the solution at hand. The tracefree parts of E . . . are the 
analogues of the Newtonian multipole moments. The constants F . . . play 
an analogous role for the “angular-momentum aspect”, which does not have 
a Newtonian counterpart. The three-metric hij, for reasons explained after 
(3.14), has no independent degrees of freedom. 

This theorem shows, in essence, that any stationary, asymptotically flat 
solution to the Einstein vacuum equations is uniquely determined by the 
“moments” E...,F.... However no statement concerning convergence of 
series like the ones appearing in (3.22-25) can be made. In order to do that 
it is necessary to use “conformal compactification” of three-space N. 



3.2 Conformal Treatment of Infinity, Multipole Moments 

Before turning to the situation G.R., it is instructive to recall the Newtonian 
situation. Suppose we are given a Newtonian potential near infinity, i.e. a 
function (j) with 

Z\<^ = 0 on R^\B{R). (3.26) 

Extending (p smoothly to all of R^, we thus have that 

A(j) = ATTp with p € C(j”(R^) (3.27) 

and (/)—>■ 0 at infinity. Thus 4> is of the form 

(j){x) = - f \ dx'. (3.28) 

7r3 \x-x'\ 

In R^ \ B{R) this can (see e.g. [33]) be expanded in a standard fashion in 
powers of 1/r. One obtains an expansion of the form 



' = E 



fi]. . 









(3.29) 



with totally symmetric and tracefree. One shows [33,73] that this series 

converges absolutely and uniformly in R^ \ B{R) for sufficiently large R. 

As a warm-up for G.R. it is useful to rephrase the Newtonian situation 
using “conformal compactification”. First observe that there is a positive 
smooth function fi on N = R^ \ B{R) with the following properties. The 
metric ^ 

h^j — f2 



(3.30) 
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extends to a smooth metric on the one-point compactification 



N = N U {r = 00} = N U {A}, 


(3.31) 


where 




17 yl = 0, Dif2\A = 0 


(3.32) 


and 




DiDjQ - 2hij = 0. 


(3.33) 


To prove this, take 17 = l/r^ and introduce 




~i x’- 


(3.34) 



as coordinates on N. One also sees that hij is again the standard flat metric 
in the coordinates P. (This would also follow from (3.33) and the standard 
formula for the behaviour of Rij under conformal rescalings.) As for the 
potential, rewrite (3.26) as 



^ = 0, (3.35) 

and observe that the operator in (3.35) obeys 

(3.36) 

when hij = f2^hij and (j) = for arbitrary 17 > 0. Thus we again have 

0 = 52 ^= 0 , (3.37) 

at first only on N. 

In the case of G.R. we were unable to prove convergence of the multipole 
series, but only an asymptotic estimate like 



™ ) F- ■ - 



(=0 



£ir^e+i 



O 



oo^y,— m-l-l — 



(3.38) 



But, from (3.38) for m = 4, it follows immediately that c/> extends to a C^- 
function on N. Thus, by continuity 

52 ^ = 0 on N. (3.39) 



But it is a standard fact that solutions to the Laplace equation and, more 
generally, for non-linear elliptic systems with analytic coefficients [55], are 
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themselves analytic. Thus (j) has a convergent Taylor expansion at the point 
A. But this is nothing but (3.29) in inverted coordinates. Furthermore the 
multipole moments can now be viewed as the Taylor coefficients of 4> 

at A. It follows from (3.39) that they have to be tracefree, and it is trivial 
that they determine (j) uniquely. 

Suppose f2 is just required to satisfy (3.32,33). Then, given hij, there is 
in {hij, f2) the following 3-parameter gauge freedom 

Q' = toil, 

^ij ~ ^ ^ij ^ 

where 

uj = {l-ED,f2 + Eb,f2)-\ (3.42) 

with DiJA = 0, which, in the compactified picture, corresponds to the freedom 
of choosing an origin in the “physical” space R^, w.r. to which the inversion 
P = x'' /r'^ can be made. Therefore the Taylor coefficients of U at A behave 
under (3.40,41) in a way which precisely corresponds to their dependence on 
the choice of origin. 

In G.R. it is impossible to require a conformal compactification for which 
(3.33) holds everywhere. We call a 3-metric hij on a manifold N = Tl^\B{R) 
conformally C^, when there exists a C^-function 17 > 0 on such that 
hij = n^hij extends to a C^-metric on TV = iV U {A} and 



(3.40) 

(3.41) 



1?U = 0, Af^U = o, 



(3.43) 



{DiDjQ 




(3.44) 



A scalar potential (j) is called conformally C^, when (j) = extends to a 

C^'-function on TV. Given (3.43,44) there is now a much larger gauge freedom 
involved in constructing the unphysical from the physical quantities, namely 



Q' = ujQ, h'ij=oj^hij, (j)' = Lo (3.45) 



where lo satisfies uj\j\ = 1. Now consider, following Geroch [22], this recur- 
sively defined set of tensor fields on TV : 



Po 


= <(> 










p^ 


= Di^ 










Pij 


= TS 


DiDjcj) - 










= TS 


Dim+lP^l 


..im 




im + 1 



(3.46) 

(3.47) 

(3.48) 

(3.49) 
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where TS denotes the operation of taking the symmetric, trace- free part. It 
turns out that the tensors 






(3.50) 



behave under (3.45) in exactly the same way as the Newtonian moments 
under the restricted gauge freedom (3.40-42) with bi = DiUj\A- Thus the Ricci 
terms in (3.46-49) cancel out unwanted dependencies from higher-than-first 
derivatives of w at A. 

Now return to the expansions (3.22-25) for some fixed m > 1. Performing, 
again, an inversion a;* = jr^ and setting, in these coordinates, 

(j)M = (j^S = (3.51) 

hij = (3.52) 

with 17 = 1/r^ we find that {(pM, 4>S, hij) are all C"". Furthermore 17 is C°° . 
Thus we have obtained a C"* conformal compactification. Our proof would 
be complete if we could find an elliptic system satisfied by {Kj,(pM,<Ps) or 
quantities derived from them. Doing this is not completely trivial. We explain 
the essentials in the static case where = 0. Thus 






(3.53) 



M.y 



1 + 40M 



(3.54) 



Let us assume that M ^ Q. Define, instead of 
factor also called 17 by 



[(-P)l/2_1]2 

(_P)l/2 



1/r^ 



as above, a conformal 
(3.55) 



It is not hard to see from (3.22-25) that this yields a C™-compactification 
{4>M,hij) where, however, we have for convenience replaced (3.44) by 







0 . 



(3.56) 



R is useful to employ, as the scalar variable in the unphysical picture neither 
(pM nor 17, but the quantity a defined by 



n-vy/^-iY 

_(-P)l/2 + l_ 



(3.57) 



After some labor we find from (3.53,54) that 



(3.58) 
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and 

— cr(l — cr)Mij = DiDjcr hijD^a. 

3 




(3.59) 


The scalar a satisfies 




crU-0, DicrU-O, 


(3.60) 


Taking a “curl” of (3.59) we obtain 




(1 ( 7 . 


(3.61) 



If we take Z?® of the quantity and use the Ricci and Bianchi identities 

we find the relation 

(3.62) 

Using that K is zero and (3.62), writing and using (3.59) to elimi- 

nate second derivatives of cr, we obtain an equation of the form 

D^Tij = non-linear terms, (3.63) 

where these non-linear terms depend at most on t^-, a and their first deriva- 
tives and on D^a. We call D^a = p. From the divergence of (3.59) we infer 
that 

PO•=^(5cr)^ (3.64) 

and from this after some work that 

D^p = 3cr(l - cr)^iyi*^' -b 3i„(5V)(5V). (3.65) 

Now (3.63) can be completed as follows: 

Rij = Tij (3.66) 

D^Tij = non-linear terms (3.67) 

= p (3.68) 

&p = 3cr(l - cr)2iyib (3.69) 

Going over to harmonic coordinates, the “non-elliptic” terms in the expres- 
sion of Mij in (3.66) in terms of the metric go away, and the whole set of (3.66- 
69) becomes an elliptic system. Note that the point of the whole manoeuvre 
was that the original eq. (3.59), when written in terms of hij is singular since 
g\a = 0. The miracle was that, in the transition from (3.59) to (3.61) a factor 
a is obtained on both sides of (3.61) which can be cancelled since cr is nonzero 
outside A by (3.57). 

Thus, taking m sufficiently large and appealing to the theorem of Morrey 
[55], we have the 
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Thmrem: When M ^ 0, there is a chart in a neighbourhood of A for which 
(a,hij) are analytic. Consequently, from (3.57), 17 is also analytic, and so is 

= (1 - 

An analogous result can be proved for a suitable set {hij^Q,4’M,4’s) in 
the stationary case [6], see also [39]. The equations one obtains imply in 
particular that the “physical” quantities (hij, (l^s) have an analytic chart 
in a neighbourhood of each point of N and thus entail the “classic” result of 
Miiller zum Hagen on the analyticity of stationary vacuum solutions [57] . 

By smoothness of {hij, (pM-, 4’s) we can define multipole moments for each 
of 4>M:4>S^ following (3.46-50). One can show [73] that they coincide with 
the quantities E . . . and F . . . in the expansions (3.22-25). (These, in turn, 
coincide with the ones in Thorne [74], as shown in [24]). One can now prove 
[6], that these moments determine the stationary solution uniquely up to 
isometries. We give a more careful formulation of this result only in the 
static case. 

Theorem: Let there be two static solutions with the same hij\j\, the same 
M 0 and the same set of (mass-centered) multipole moments. Then the 
corresponding physical solutions (hij, (I’m) are isometric. 

The proof is a not-too-difficult inductive argument based on (3.61,62), 
(3.68,69) and (3.59,60). 

There remains the question to what degree the multipole moments of 
stationary solutions can be prescribed. It is fairly easy to see, e.g. from 
the asymptotic analysis of Sect. 3.1, that the multipole moments are “al- 
gebraically independent”, i.e. for a given finite number of them, there always 
exists a spacetime having those moments which solves the stationary field 
equations to arbitrary order in 1/r. It is not known what conditions on mo- 
ments for high order have to be imposed in order for the multipole expansion 
to converge. In particular, convergence is not even known when only finitely 
many moments are non-zero. 

There are of course solution-generating techniques to in principle write 
down the general stationary axisymmetric spacetime. To date the only result 
on existence of stationary asymptotically flat solutions without any further 
symmetry is that of Reula [66] . 

We note, in passing that the above equations lend themselves to an easy 
proof of a result which is often used in black-hole uniqueness theorems (see 
[31]). Namely an asymptotically flat, static vacuum solution with M 0, 
which is spatially conformally flat, has to be isometric to the Schwarzschild 
metric near A. To see this, use that now the Cotton tensor of hij is zero. 
Thus, since K. = 0, the left-hand side of (3.61) vanishes. Contracting the r.h. 
side of (3.61) with (D'^a)W^ we find that 



2 



(3.70) 
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But, by Cauchy-Schwarz, the right-hand side of (3.70) is bounded above by 

which has hence to be zero. Since a can not have critical points near A except 
at A itself, it follows that 

i„- = 0, (3.71) 

whence, from (3.59), = 3/2M^ and thus, in a chart a;* for which hij = (5^ 

we have a = |a?p/4M^, from which it easily follows that {hij,4>M) corresponds 
to Schwarzschild with mass M. 

4 Global Rotating Solutions 

4.1 Lindblom’s Theorem 

Lindblom showed in his thesis [45] that stationary asymptotically flat dissi- 
pative fluid configuration are axisymmetric. In this section we want to outline 
and discuss this theorem. 

There are three ingrediences of the proof: 

(i) The local fluid field equations imply that the fluid flow is proportional to a 
Killing vector provided the divergence of the entropy current vanishes. 

(ii) The Killing field has an extension into the vacuum field of the solution. 

(iii) If the manifold of orbits of the stationary Killing vector is and 
asymptotically flat, then is linearly independent of t^. The two Killing 
fields commute and there is a linear combination of the two Kiliing fields 
which has fixed points near which it act like a rotation. 

(i) Theorem: Let be a stationary local solutions of the Einstein field 

equations for a one-component fluid with phenomenological heat conduction 
and viscosity laws and vanishing of the divergence of the entropy current. 
Then the fluid flow is proportional to a Killing vector. 

Proof: The energy momentum tensor for a fluid with shear and bulk viscosity 
is [54] {9 and are the expansion and shear of the fluid; is the heat flow 
[ 20 ]) 

= pu^u’' + {p- C0)h'^" - 2r,a^‘' + + q''u^^ (4.1) 

with 

= 0, = 0 . (4.2) 

This implies (a dot denotes the covariant derivative in the direction of the 
fluid flow u^) 

0 = -(V^T'^")u, = p+{p + p)6- + q^ii^ . (4.3) 
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Introducing n, the conserved rest-mass density, and the specific volume f = ^ 
and the specific internal energy m = ^ we can rewrite this, using = 

0, as 

n{u + pv) - (0^ - 2rja^''a^„ + + q^ii^ = 0 . (4.4) 

For a one-component fiuid we have an equation of state u = u{p, v) and conse- 
quently there exist scalar functions T(p, v) and s{p, v) with the interpretation 
of temperature and specific entropy such that 

du + pdv = Tds. (4.5) 

Hence n{u + pv) = nTs can be used to rewrite (4.4) as 

nTs- (6'^ - 2?7cr'^'^cr^i, -|- -I- = 0 (4.6) 

or 

ns + T-^V^q^ = r-i(C0" + - q^ii^) = 0 . (4.7) 

Using again V^(nrt^) = 0 we obtain 

V^(nsw'^ + + 277 ^ 7 '^" - q^{ii^^ + T-^V^T)] = 0 . (4.8) 

Inserting the phenomenological law of heat conduction 

q^ = -nh'' + Tii„) (4.9) 

we obtain 

V^insuf^ + T~^q>^) = T-\C0^ + 277fT'^"fT^, + K,T-\^qf^) = 0 . (4.10) 

The left-hand side of this equation is the conserved entropy current 
which vanishes according to our assumptions. Hence the positivity of A,C 
and K implies 6 = = q^^ = 0 and = — T“^T^ . 

Assume T ^ 0 and consider = T~^u^, the candidate for the Killing 
vector. We have 

ifiiv) = -T~'^V . (4.11) 

The vanishing of 0 = = q^ = 0 implies = 

T~^iy (iiT)u^), hence = 0. 

Now we come to the most complicated part, the extension of the Killing 
vector proportional to the fiuid flow from the fiuid into the surrounding vac- 
uum region. 

(ii) Conjecture: Let gfny,T^i, be a strictly stationary, asymptotically fiat 

perfect fiuid solution where the matter is a ball of finite extent and the fiuid 
flow is proportional to a Killing vector . Then has a unique extension into 
the vacuum region, provided certain differentiability conditions are satisfied 
at the boundary. 
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In Lindblom’s original treatment this conjecture was shown to be true 
under the assuption that the outside metric is analytic up to and including 
the boundary S of the fluid. Then one can propagate the Killing vector into 
a neighbourhood of the boundary using the Cauchy Kowalevskaja theorem 
because a Killing vector satisfies a wave type equation. Finally a theorem by 
Nomizu [61] can be used to obtain a global Killing vector field. 

One might, however argue, that analyticity up to and including S is too 
strong an assumption. On physical grounds one would like to treat also non- 
analytic equations of state. In this case it is unlikely that the metric is analytic 
in the boundary. 

Finally we show that the new Killing vector is actually different from 
the stationary Killing vector 

(Hi) Theorem: Under the assumption of the above conjecture we have: 

(1) The Killing vectors and are linearly independent. 

(2) Both Killing vectors commute. 

(3) There exists a linear combination + which has fixed points 

and acts like a rotation with closed orbits. 

Proof: (1) Suppose would be linearly dependent of Then there would 
be a timelike Killing vector, namely which is asymptotically a translation 
and relative to which the matter does not rotate. Hence, by the Licherowicz 
staticity theorem, spacetime would have to be static. 

(2) As T and are invariant objects we have C^T = 0 and = 0 
which imply immediately [^,t] = 0 on the support of the matter. To show 
that this is also true outside the matter one can use the analyticity of the 
outer metric up to and including the boundary or one can use a theorem by 
Beig and Chrusciel [4] classifying all possible group action on asymptotically 
flat spacetimes. 

(3) As commutes with there is a Killing vector P on the manifold 

of orbits of The corresponding group acts in the 2-surface of constant 
pressure, in particular in the boundary, p = 0. As this is topologically 5^, 
there must be a point where P vanishes. A Killing vector on a Riemannian 
space with a fixed point acts always as a rotation with closed circular orbits. 
At a point q in spacetime projecting on the fixed point of P, must be 
proportional to and therefore a linear combination r]^ = with 

constant coefficients vanishing at q exist such that rj^{q) = 0. We have a 
fixed point and because also the timelike direction of is fixed, acts like 
a rotation and has therefore closed orbits. 

We see that we can obtain the existence of the axis working only on the 
body, provided we know that both Killing vectors are independent. Lind- 
blom [43] obtains the axis and commutativity of the Killing vectors from 
the asymptotic symmetry group. The key property that the two Killing vec- 
tors are linearly independent is only implied by a global argument and uses 
asymptotic flatness. 
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4.2 Existence of Stationary Rotating Axi-symmetric 

Fluid Bodies 

Following work by LiapunofF and Poincare, Lichtenstein [42] demonstrated 
at the beginning of this century the existence of rotating fluid bodies in 
Newtonian theory. An account of this almost forgotten work can be found 
in [71]. Using implicit function theorem techniques - as we would say today 
- he shows the existence of solutions near known solutions or approximately 
known solutions: starting with a static fluid ball, he obtains a slowly rotating 
fluid ball; starting from a self gravitating 2-body point particle solution, he 
obtains a solution for two small fluid bodies orbiting their center of mass on 
a circle. Furtheremore, there is a number of exact solutions in Newtonian 
theory: the Maclaurin spheroids, the Jacobi and the Dedekind ellipsoids and 
the Riemann ellipsoids [18]. 

In Einstein’s theory we do not know any stationary exact solution de- 
scribing an extended rotating body. Spacetimes describing such solutions can 
be characterized as follows: Besides a timelike Killing vector there is a 
further symmetry, the axial symmetry generated by whose orbits are cir- 
cles (Remember that we showed in Sect. 4.1 that such an extra symmetry 
exists on physical grounds) The body is spatially compact and the space- 
time with topology is assumed to be asymptotically flat. We assume that 
there is an axis where R vanishes. Then we can use a result of Carter [16] 
which states that under these circumstances the two Killing vectors commute. 
Such spacetimes are called ’’stationary axisymmetric” , the orbits of the axial 
Killing vector are circles. 

We showed in Sect 2.2 that for stationary axisymmetric perfect fluids 
with an axis and a fluid flow vector contained in the two-surface spanned 
by the two Killing vectors, the two-surface elements orthogonal to the two- 
dimensional group orbit are surface forming ( the group action is orthogonally 
transitive). The same holds in the vacuum region. The property of orthogonal 
transitivity is equivalent to the existence of a discrete isotropy group [70]. 

To introduce a global coordinate system let us assume that outside the 
2-dimensional axis the spacetime is the product of the orbits of the isometry 
group and the orthogonal 2-surface which we assume to have topology R. 

Using coordinate adapted to the Killing vectors the metric can be written 
as 

= gAB(x'^)dx^dx^ + goo{x^)dt'^ + 2gQ^{x^)dtd^ + g^^{x^)d(j)^ . (4.12) 

Locally we can always introduce coordinates {x^) = (r, z) such that gAB 
is conformal to the flat metric in standard coordinates and can therefore write 
the metric as 

ds^ = + dz^) + - e^^(dt + Ad(f)f. (4.13) 

There is the freedom in (r, z) of an arbitrary conformal transformation 
which is given by the real part of analytic function. 
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The function is the volume element of the group orbits. As a conse- 
quence of the field equations in vacuum one can locally achieve W = r' such 
that 

ds^ = + dz''^) + e~^^r'^d(l)^ - e^^{dt + Ad(j)f . (4.14) 

These coordinates are called Weyl’s canonical coordinates. Matters can be 
arranged so that r' = 0 is the axis. Then the coordinates are fixed up to a 
translation in z' . 

It is tempting to try to extend the Weyl coordinates from the outside of 
the body to the interior such that the two-surface orthogonal to the group 
orbit is covered by one (r', z') system with r' = 0 describing the axis and 
W ^ r' ui the interior. However, Muller zum Hagen has demonstrated [56] 
that this is impossible in the case of static spherically symmetric solutions, 
(r' becomes negative inside the body and the axis is reached for p' — ?► oo.) 
There is no reason to assume that this would be different in the stationary 
case. 

Numerical codes work successfully with a global (r, z) coordinate system 
such that r = 0 is the axis but it is not assumed that one has Weyl’s canonical 
coordinate in vacuum. 

For perfect fluids whose velocity is proportional to a constant linear com- 
bination of the two Killing vectors, the case of rigid rotation, = 0 

becomes particularly simple. (See equation (2.69).) 

0 = V,V = (P + P)^(ln/-").^ +P.^. (4.15) 

where = /^(^^ -I- is the four velocity of the fluid. This shows that, 
provided an equation of state p{p) is given, the matter variables p and p can 
be expressed as functions of the quantity / which is determined by the geom- 
etry. This property of rigidly rotating fluids is essential for all the numerical 
schemes as well as for all the attempts to prove existence. 

Various authors have developed codes to calculate numerically stationary, 
axisymmetric rigidly rotating fluid solutions [12]. Today this can be done 
with very high presicion by different numerical techniques. These numerical 
solutions are also the basis for investigations of oscillations of rotating stars. 

Schaudt and Pfister [68] try to obtain an existence theorem working in 
the above coordinates adapted to the symmetry. This approach is attractive 
because the field equations become semilinear elliptic. One has, however, to 
control the singularities in the equations on the axis. This is possible and 
two Dirichlet problems have been solved, which give existence of outside, 
asymptotically flat solutions and existence of inner parts of bodies, provided 
appropriate boundary values are given [67]. Up to now this was only possible 
for the ’’outer” and the ’’inner” problem separately and work is in progress 
which tries to combine the inner and outer solution. 

Let us now turn to the discussion of the only existence theorem for ro- 
tating fluids in Einstein’s theory. It is remarkable that the first existence 
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theorem for rotating fluids, proved by Heilig in 1995 [30], uses Lichtenstein’s 
technique and does not adapt the coordinate to the axial Killing vector to 
avoid difficulties at the axis. 

Let us formulate one particular case of the theorem proved by Heilig [30] . 

Theorem: Let p{p) = Cp^ be a polytropic equation of state with 1 < 7 < 
6/5. The central density po determines a unique Newtonian static fluid ball 
solution of finite extent. Then there exist a positive constant Qq such that for 
all I? with 0 < < Qq a, stationary axisymmetric rigidly rotating solution 

with angular velocity 17 of the Einstein held equations for a perfect fluid 
exists. The solution is geodesically complete, asymptotically flat with finite 
mass and angular momentum. The matter is of finite extent and has the same 
equation of state and central density as the Newtonian solution. 

The theorem holds also for more general equations of state. It is not clear 
whether the case of positive boundary density may be treated by this method. 

Heilig uses the observation of Jurgen Ehlers [21] that it is possible to 
write the held equations as an elliptic system with a parameter X = c~^ - 
interpreted as the velocity of light - such that the equations for , A — >■ 0 give 
the Newtonian equations and the limit is regular. This can be achieved by a 
particular choice of unknowns for which the held equations are formulated. 

We will describe the structure of Heiligs proof using the equations for- 
mulated in Sect. 2.4 because these are much simpler. We want, however, to 
stress that we expect that Heilig’s result could be proved more easily using 
these equations, but this is not certain before all the functional analysis has 
been done properly. 

Let us first adapt the held equations to a rigidly rotating fluid. We write 
the axial Killing vector as in (2.54) 

rjfidx^ = r]{cdt + 4>idx^) + rpdx^ . (4.16) 

The Killing equation in spacetime is equivalent to the equations (2.57-59) on 
the quotient N. For a rigidly rotating perfect fluid with fluid flow vector 
we have 

= /(Cm + ^ = const , = -c^, (4.17) 

where 

/-2 = e"^(-e^ -k c~^nr]f - . (4.18) 

To obtain the held equation we replace in (2.64-66) 77 by c~^rj and p by c~^p 
to obtain from (2.46-48) using U — >■ c~‘^U 

D^U =4TrG[f{-e^ + c~^ npf{p + c~^p) + 2c~^pe~^] 

+ c~^e~^ pfl‘^{p + c~^p)r]ir]jJd^e~^ — e~^ LOijLO^^ (4.19) 
D^LOij = 87rGc“^e “^/^(— + c~^ [2r]){p + c~^p)firij (4.20) 
Rij = 2c~^DiU DjU — 2e^ uoikdoj + hijC^ 
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+ SttGc ^[-2pe ^ hij + + c '^p)ri^rij 

- hijff2'^{p + c~‘^p)piT]mh’''^] (4.21) 

The above field equations have to be supplemented by the Killing equations 
(2.57”2.59). The equations of motion are 

= 0 ^ (cV + p)f-^D,f = D,p (4.22) 

For c — >■ oo we have from (4.18) that /^ = 1 which implies by (4.20) that 
D'^ujij = 0. The staticity theorem now gives that = 0. Using (2.58) this 

2U 

implies Di[e r]) = 0. The vanishing of rj on the axis implies rj = 0. Using 
all this the field equations reduce to 

= 0 , D^U = A-kGp (4.23) 

Therefore the metric on N is fiat. Using Dif) = -Di{U-\Q'^pirf) 

the equation of motion become the Newtonian equation 

-pA(t/ - = D,p (4.24) 

equation {rjir]^ = + y'^ in Cartesian coordinates). 

As discussed in Sect. 2.5 for the static case, the field equations become 
again a quasilinear elliptic system for U, , pi in harmonic coordinates 
(V;_jV^t = 0 O'- Pi = 0, = 0 = 0). Namely, the condi- 

tion that the time function is harmonic turns the left-hand side of Equ.(4.20) 
into an elliptic operator acting on pi. Harmonicity of x* has the same effect 
on the left-hand side of Equ.(4.21). Theorem 4.1 of Heilig [30] can be adapted 
to show that for small A, 17 a solution of the reduced field equation exists near 
the Newtonian solution. Such a solution satisfies only the harmonicity condi- 
tion if the equation of motion holds. So, this has to be solved simultaneously. 
This is possible because given a equation of state (4.22) can be integrated 
such that the matter quantities can be expressed in terms of the geometrical 
quantity /. Therefore the following iteration procedure is well defined: begin 
with a Newtonian solution U^,p^; choose some A, 17 and use as a 

source in the field equations in harmonic coordinates to obtain p^i. 

Calculate / from , Z^'^Gp^i^X, f2 and determine p^ from the equation of 
motion. Then one solves again the field equation with the new source and 
so on. Heilig has shown that for sufficiently small A and 17 such an iteration 
converges in his variables. It should also converge in the variables used here. 

It is remarkable that we have used rjidx'^ = xdy — ydx as a given field. At 
the end one has to check that the solution is axisymmetric and satisfies the 
harmonicity condition. 

Note that only for A = c~^ with some fixed value of the velocity of light 
in some units the above field equations are Einstein’s equations. It is however 
possible to reinterpret solutions with any A as solutions of Einstein’s equation 
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expressed in different units [30]. With this interpretation the theorem above 
demonstates the existence of slowly - the theorem does not control the range 
of w - rotating fluid configurations. 

4.3 The Neugebauer Meinel Disk 

The only known global solution describing a rotating object in Einstein’s 
theory, is the relativistic analogue of the rigidly rotating Maclaurin disk in 
Newton’s theory [10]. 

An axisymmetric surface density distribution (in cylindrical coordinates 

I 7 ^ 

cr(r) = cToW 1 2 ’ 0<r<a, (4.25) 

V ^0 

generates a gravitational potential l?(r, z), which is determined by the Poisson 
integral from cr. At the disk the potential is 

1 TT^OfT^ 

<P{r,0) = -I- const , 0 < r < a , V? = • (4-26) 

2 i'q 

Outside the disk the potential can be expressed, for example, in terms of 
integrals over Bessel functions. 

The centifugal force acting on rigidly rotating particles balances the grav- 
itational force in the disk. Therefore, we can interpret the density distribu- 
tion as formed by self gravitating, rigidly rotating dust. The two parame- 
ters CTo 3'iid tq determine such disks uniquely. The total mass of the disk is 
M = |7T(To?'o- 

Neugebauer and Meinel found the relativistic analog of these disks [59]. 
There is a well known formalism available in General Relativity to de- 
scribe matter surface distributions [31]. In the particular case of a reflection 
symmetric disk, we have to find solutions of the stationary vacuum field 
equations, defined outside the disk such that the difference of the normal 
derivatives of the metric at of the disk have a certain algebraic structure [31]. 
The general stationary axisymmetric metric can be parametrized as 

[e^^(dr^ -|- dz^') -I- r'^dcj)^] — {dt + adcj))^ . (4.27) 

The metric coefficients U, k and a depend only on r, z; the vector fields 
^^d/dx^ = d/dt and rj^d/dx^ = d/d(j) are Killing vector fields. We assume 
that the orbits of the axial Killing vector are circles; r = 0 is the axis. 

Let us assume that the disk is located atz = 0, 0<r<ro.A rigidly 
rotating flow forming the disk is described by a vector field (which is defined 
at the disk) 

= e~^ ie + = (4.28) 

where I? is constant. The definition of a dust disk implies that the metric 
is continuous across the disk and that = au^u'', where cr is the surface 
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density, satisfies = 0 with respect to the Levi Civita connection of the 
metric induced on the disk. As is a Killing vector, it holds 

= 0, = 0, VfiV^ = 0 and we obtain 

v^v'^) . (4.29) 

Finally = gfi^v^v'' implies = 2g^^v>^v'' --y = 2v'^Vy-v and we see 

that V must be constant on a disk formed of rigidly rotating dust, K = Vo- 
lt is natural to introduce comoving coordinates 

t' = t; (j)' = =^>^ + , 77^' = (4.30) 



The vacuum field equations can be expressed in terms of the following quan- 
tities: 



: a' = -e , C/'(r, 2 = 0) = Vb = const 

(4.31) 



and b'{r,z) determined by 

a'j. = , a'., = — re“^^ b'^ . (4.32) 

Using the Ernst potential f = + ib' the key field equation is the semi- 

linear elliptic Ernst equation [37] 



Reinu' + fu 



+ -f[r) = f[l 



+f:i 



(4.33) 



For a solution of the Ernst equation the integrability condition of (4.32) is 
satisfied and one can solve for a' . The remaining metric coefficient k' follows 
from the equations 



u'; - uj + - b,l) 


II 


c 7 ,;c/;, + {b[,b'j 






(4.34; 



whose integrability condition is again satisfied for solutions of the Ernst equa- 
tions. 

We can perform an integral in the (r — z) -plane around the disk of the 
integrability condition of (4.32), namely 

(r-ie^^'a(J,^ -b = 0 , (4.35) 

which can be replaced by a surface integral. As we assume that the tangential 
derivatives of the metric are continuous at the disk, we obtain at the disk 



z=0+ — C- z\z=0- ■ 



(4.36) 



On the other hand reflection symmetry at the disk implies 

0 + C^,z\z—0~ 



(4.37) 
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on the disk, hence, 

®!zlz=o+ = o!,z\z=o~ = 0 ) (4.38) 

which by (4.32) implies b' = const on the disk. 

Now it is easy to calculate the second fundamental form kcd = (jcd,z 
of the disk z = 0 ( c,d,... = because we have at the disk that 

a'z = = 0, as a consequence of (4. 38), (4. 35) and (4.34). We find 

krr = —2U'^grr (4.39) 

4,^, = (4.40) 

kt>t> = 2U;,grt' (4.41) 

kt>^, = 2U[,gt>^'. (4.42) 

Now we can check the condition for a disk of dust [31], namely 

- ~kcd = 2+kcd = -87r(Tcd - = -8n{aUcUd + ]^(Jgcd) , (4.43) 

which, in the primed coordinates (because of = S^,), reads 

kc'd' = -87Ta{gc't’gd’t’ + gc'd') ■ (4.44) 

Because of the form of the metric (4.27) in primed coordinates and by (4.39- 
42), this is satisfied if we define the surface density by 

a = ^C/;, . (4.45) 

Thus we have shown that a rigidly rotating disk of dust is determined by 

a solution of the Ernst equation which satisfied at the disk U' = const and 
b' = const. Outside the disk the solutions of the Ernst equation must be 
regular. For a well-posed elliptic problem we need furthermore asymptotic 
flatness at infinity and regularity conditions at the axis. 

In Newton’s theory there is a 2-parameter family of disks (4.25), (4.26). If 
we use the 2-parameter group of similarity transformations - or dimensionless 
quantities - we can assume rp = 1 and cto = 1 £md we have just one disk. 

Because of the appearence of the velocity of light there is only a 1- 
parameter group of similarity transformations in Einstein’s theory. Hence, 
after we put ro = 1, we expect a 1-parameter family of disk solutions. 

The investigations of Neugebauer and Meinel suggest that 

fj. = (4.46) 



is an appropriate parameter. 

Neugebauer and Meinel prove by the so called inverse scattering method 
of soliton physics (compare the contribution of Maison in this volume) that 
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the boundary value problem for /' has a unique global solution provided Vq 
and U! are such that ^ < ficrit = 4.629 . . .. 

The solution f can be expressed in terms of hyperelliptic theta functions 
[60] . The remaining metric coefficients a' and k' are determined by integration 
from the equations (4.32) and (4.34). 

If we put the velocity of light, c, in the appropriate places we obtain the 
MacLaurin disk as a Newtonian limit. 

Further properties of these disks are discussed in [60]. 

Many global stationary solutions with disk sources may be constructed 
from known stationary vacuum solutions by “cutting out” a region containing 
singularities an making appropriate identifications. This method was first 
used by Bicak and Ledvinka [9] to produce physically plausible sources for 
the Kerr metric with arbitrary values of the parameters a, M. These disks 
are made of two streams of particles circulating in opposite directions with 
differential velocities. They are extending to infinity but have finite mass. 
See Sect. 6 of the article by Bicak in this volume, where this procedure is 
related back to the “method of images” in Newtonian galactic dynamics. In 
the static case these methods yield an infinite number of such static disk 
solutions. Solutions corresponding to stationary counterrotating dust disks 
of finite extent have been constructed by Klein and Richter [35]. 

5 Global Non-rotating Solutions 

5.1 Elastic Static Bodies 

No doubt, Einstein’s theory should allow for the description of static, solid 
bodies. It is useful to make the following distinction: 

(i) small bodies, whose shape is not dominated by gravitational forces, 
like a piece of sugar or an iron ball. If we ignore gravity, the structure of 
the body is determined by the laws of quantum mechanics. This is true in a 
Galilei invariant formulation as well as in a special relativistic one. Linear and 
non-linear elasticity theory describes the deformation of such a configuration 
under external forces. 

Suppose we now want to add the gravitational field. This is straightfor- 
ward for linear elasticity in Newtonian theory; we just have to insert the 
gravitational field calculated from the Poisson integral as an external force 
into the equations of elasticity. 

To pass from special relativity to Einstein’s theory is more complicated. 
Now the deformed configuration has to satisfy Einstein’s field equations, and 
the elasticity equations are a consequence of the latter! 

(ii) bodies like stars whose shape is dominated by gravitational forces. 
There a relaxed state does not really exist and one has to modify the de- 
sciption of elasticity. This holds in Newtonian theory as well as in Einstein’ 
theory. 
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Elastostatics can be described in Einstein’s theory as follows [17]. The col- 
lection of particles which form the body is described by the three-dimensional 
“body manifold” B. The essential dynamical variable is a map <P : — ?► B. 

such that is the world line of the particle in spacetime labeled by 

y® in B. In the static case the world lines of the particles are the integral 
curves of the Killing vector, and we can consider <P as a 1-1 map N ^ B. 
We assume that we have given on B a Riemannian metric Kij. Its physical 
interpretation may be different: for small bodies it describes a relaxed state; 
for big bodies which go never into a relaxed state, it could be an “isotropic 
state of minimal energy” . 

We need now information about the energy momentum tensor of the 
material. Let 

T'®®® = pu'^u'' + p'®®® , = 0 (5.1) 

be such that the stress tensor p'®®® has only spatial components and can be 
considered as a tensor on N. We can now define 



62 j . 



(5.2) 



the “Lagrangian strain tensor”. In the Hookian approximation of elastcity 
one assumes that one has given on the body B a tensor field such that 

after moving this object with <P into the space N' one can define 





(5.3) 




(5.4) 



as the energy and stresses of the body B in 3-space with the metric hij. 

With this energy momentum tensor we consider Einstein’s field equations 
as differential equations for the spacetime metric and the map No general 
existence theorem is available for this problem. The only case treated so far 
is the spherically symmetic one [62] . 

To get some feeling for these equation let us consider some further ide- 
alisation. For small deformations we can linearize Cij := ^{hij — as 

follows: Suppose that is a 1-parameter family of diffeomorphism N' — >■ N' 
such that e = 0 is the identity and <Pq is some diffeomorphism N' ^ B. Now 
we assume that defines our deformed body and calculate the stress 

tensor Cij to first order in e If we define = <Pot,Kij we obtain 

6b = \ihij - 

Here the vector field x® is defined by the linearization of on N' . We see 
that p®-^ ;j = 0 leads to second order differential equations for x“- 

Consider first the case of special relativity which coincides with Galilei 
invariant classical mechanics in the static situation. Then we have ha/s = rjaf} 
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and hij = 5ij. We choose to be the identity may which discribes the 
relaxed body in spacetime. With = 5ij we obtain 

(5.6) 

This gives implies the equations of classical, linearized elastostatics [51]. 

0 = = (5.7) 

With the appropriate symmetry and positivity conditions on the equa- 

tions are elliptic and solutions exist for various boundary conditions. 

Next we want to calculate the deformation of a small elastic body by 
its own gravitational field. The relaxed state is determined by solid state 
physics as above. To switch on gravity we assume that we have families 
9IJ.U = Vfiy + Ggl,i. + G'^gl^ ... and -b + G'^T^^ . . . satisfying 

the field equations. 

At order we obtain the trivial solution if there are no forces at the 
body , the density Pq is constant and the stresses vanish, i.e. = 0. The field 

equation in order G are obtained from the equations in section 2.4 with an 
energy momentum tensor which has only a term pg because the stresses 
vanish. We obtain as a solution of the Poisson equation with the source 
source p[j. The metric hX remains flat in this order. The expansion of the 
equation of motion in G gives to first order 

pib . = = -plu\a , AU^ = 47tGp° (5.8) 

Hence we obtain classical elastostatics with the force deforming the body 
being the gravitational force. 

One might try to obtain an existence theorem for small self gravitating 
elastic bodies in Einstein’s theory by an implicit function theorem argument 
similarly as in the case of a rigidly rotating body (section 4.2). 

5.2 Are Perfect Fluids 0(3)-Symmetric? 

It is intuitively “obvious” that a static, in particular non-rotating, ball of 
perfect fluid, due to the absence of shear stresses should have spherical sym- 
metry, and in particular the gravitational field in its exterior should be the 
one described by the Schwarzschild spacetime. This result, in its most general 
form, is still open in G.R. (The Newtonian case was settled in Lichtenstein [42] 
and Carleman [15], see also Lindblom [44].) Rather, one has today a theorem 
which is essentially a uniqueness result in the spirit of black hole uniqueness 
theorems. An earlier result due to Kiinzle and Savage [40] states that, near a 
spherical solution, there is no aspherical one with the same equation of state 
and the same mass. 

The following result, due to Beig and Simon [8], is a refinement of previous 
work by Masood-ul-Alam [52], see also the review of Lindblom [47]. 
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Theorem: Let us have a static, asymptotically flat, spherically symmetric 
solution to the Einstein equations with a perfect fluid and barotropic equation 
of state p = p{p). (This solution is called reference spherical solution.) Let 
there be given another static, asymptotically flat solution with the same 
equation of state and the same value E|as of the Killing vector norm on 
the surface dS of the star. Let further p{p) satisfy the differential inequality 
/ < 0, specified later. Then these two spacetimes are isometric, in particular 
the second one is also 0(3)-symmetric. 

The condition stipulating the existence of a spherical reference solution 
was disposed of by Lindblom and Masood-ul- Alam [48] . The condition on the 
matter, besides the one stating that p > 0, p > 0 and dp/ dp > 0, is that 

1 f] K 

I -.= + 2k+ {p + p)— <Q, (5.9) 

5 dp 

where k := /T. One can check that it is for example satisfied for the 

p+3p dp ^ 

equation of state of a relativistic ideal Fermi gas at zero temperature, but 
only up to densities of order 10^®gcm“^, which is roughly the critical density 
where gravitational instability sets in. It is known from numerical results [69] 
that beyond that limit the uniqueness statement of the above theorem will 
fail. One believes however, that sphericity will still hold. 

We will here confine ourselves to an outline of the proof to the case of the 
special equation of state given by [13] 

P{p) = \ {Po = const > 0) (5.10) 

which is a relativistic generalization of the equation for a polytrope of index 
5 in the Newtonian theory. The expression in (5.10) satisfies 1 = 0. The 
reference spherical solution in this case is known explicitly [7]. It has the 
curious property that it is asymptotically flat, but the fluid extends to spatial 
infinity. 

Introducing the variable v = (— K)^/^, the static field equations for a 
perfect fluid with energy momentum tensor 

Tfj.1, = {p + p)u^u^ + pg^i, (5-11) 

with read 

D^v = Attv{p + 3p) (5-12) 

M.ij = v~^DiDjV + 47t(p — p)hij (5.13) 

The asymptotic conditions (2.102,104) imply that u — ?► 1 at infinity. ^From 
the maximum principle for elliptic equations it follows that 0 < u < 1 in A^. 
Since the surface of the star is at infinity for the Buchdahl solution, the t;las, 
which is always equal to one in that case, has to be replaced by the total 
mass M (see Sect. 4.1). 
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Applying the contracted Bianchi identity to (5.12,13), there follows 

Dip= -v~^^{p + p)DiV. (5.14) 



Thus p and p are both functions of v and 

^ = -v~^{p + p). (5.15) 

Define the Cotton tensor of hij Bijk by 

= ‘2,D[k ■ (5.16) 

With the definition 

W={Div){D^v) (5.17) 

the equations (5.12,13) now imply (see Lindblom [46]) that 

D^W = + v~\D^v){D,W) + 8 ttv{D^v){D,p) 

+ ^W-^{D^W){DiW) - 8 t:W{p + p) + 167rV(p+ 3p)^ 
-AT:v{p+8p)W-^{D^v){DiW). (5.18) 



In the spherically symmetric case W = Wq has to be of the form Wq = Wq{v). 
The ODE resulting in that case from (5.18), has, for the equation of state 
(5.10), an explicit solution given by 



Wo 



(1-uY 



1 


T^PO 


71-uy 


16Af2 


3 


U+W _ 



(5.19) 



We assume that a = poM^ > 1. The function Wq is defined for v G [0, 1]. 
It is positive for v G {vc, 1), with Vc = (\/a — l)/{\/a + 1) and Wq(uc) = 0, 
Wo(l) = 0. Thus Wq satisfies the correct boundary condition at the central 
value Vc of v and at infinity. 

We now define, for the given solution (u, hij), the scalar function 



_ 2\“4 

VE-Wo=(^^j (W-Wo) 
and the conformally rescaled metric 




(5.20) 



(5.21) 



(The constant M occurring in Wq is taken to be the mass of the given so- 
lution.) In the asymptotically flat, vacuum case discussed in Sect. 3.2 one 



Time-Independent Gravitational Fields 



365 



finds that the metric hij extends smoothly to the manifold N = N U {/I}, 
with A the point at infinity. This is also true for the Buchdahl solution, and 
we assume it to be true for the given, a priori non-spherical one. After some 
calculations we find that 

D^{W - Wo) = + ^W~^D\W - Wo)D^{W - Wq). (5.22) 

Since W, Wq also extend smoothly to N, the function W — Wq satisfies the 
elliptic equation with nonnegative right-hand side on the compact manifold 
N. After integrating (5.22) over N (or by the maximum principle) it follows 
that Bijk is zero and 

W = Wo{v). (5.23) 

It then follows from [8], that the given model is isometric to the Buchdahl 
solution with the same po and the same M. 

5.3 Spherically Symmetric, Static Perfect Fluid Solutions 

The metric for a static spherically symmetric spacetime can be written 

ds^ = -f r'^{de'^ + sin^ 9 dcj}^) . (5.24) 

For a derivation see [28,70]. Here c is a constant which plays the role of the 
speed of light. In Appendix B of [28] it is also demonstrated that the in 
front of the sphere metric is no loss of generality for a static perfect fluid 
with positive mass density and pressure. Hence it is impossible to have two 
centers or two infinities. The field equations for a perfect fluid are 

SirGc-^pr^ = e-^{rX' -1) + 1 (5.25) 

S'kGc~‘^Pt‘^ = e~^{rv' -|- 1) — 1 (5.26) 

SttGc-'^P = (^ly" + + r~\iy' - A') - ^ly'X'^ (5.27) 

A prime denotes a derivative with respect to r. We have written —c^p for the 
timelike eigenvector of the energy-momentum tensor to make the compari- 
son with the Newtonian equations easier. The field equation imply ‘energy- 
momentum conservation’, which is a single equation for a static perfect fluid 

2p' = —iy'(p + (? p) ■ (5.28) 

The first exact solution of these equation was alredy found in 1918 by Karl 
Schwarzschild, the solution with constant density [37]. We have three in- 
dependent ordinary differential equations for for four functions. Hence, one 
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function can be specified freely. The most physical case is to prescribe an 
equation of state p = p{p). Equation (5.25) can easily be integrated: 

SttGI f . , , , , 

e =1 ^ — / r^p(r)dr + const . (5.29) 

r J 

As we only are interested in solutions with a regular center of spherical sym- 
metry we define A as follows 

e~^ = 1 — / r‘^p{r)dr . (5.30) 

r Jo 

The usual definition of the ’mass up to r\ namely 





m{r) = 47t / p{s)ds 

^0 


(5.31) 


gives 


, 2Gm(r) 


(5.32) 




r 




It is also useful to introduce the following quantity which is 


related to the 


’mean density up to r’ 
Then (5.32) becomes 


II 

1 

CO 


(5.33) 


A , 2G 2 
e = 1 ^ ■ 


(5.34) 



Various forms of the equations (5.25-28) will be used. Equations (5.25), (5.26) 
and (5.28) contain all the information. If we eliminate i/' then (5.26) and 
(5.28) imply the Tolman-Oppenheimer-Volkoff equation [75] 

p' = -Gr (l - J + p) (5-35) 

If an equation of state is given we can integrate (5.28) 

‘'‘’'’“"A <'’■*> 

In this formula po denotes the central pressure. If we add the definition of 
w then (5.35) and (5.33) form an integro-differential system. Differentiating 
(5.33) we obtain 

w' = - {Att p — 3w) (5.37) 

r 

In [65] the following theorem is proved. 
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Theorem: Let an equation of state p(p) be given such that p is defined for 
p > 0, non-negative and continuous for p > 0, C°° for p > 0 and suppose 
that dp/ dp > 0 for p > 0. 

Then there exists for any value of the central density po a unique inexten- 
sible, static, spherically symmetric solution of Einstein’s field equation with 
a perfect fluid source and equation of state p{p). The matter either has finite 
extent, in which case a unique Schwarzschild solution is joined on as an ex- 
terior field, or the matter occupies the whole space, with p tending to 0 as r 
tends to infinity. 

There are two parts of the proof. The equations (5.35) and (5.37) form a 
system of ordinary differential equations for p(r),w{r). However, the system 
is singular at r = and the first step is to demonstrate that for each value 
of the central density there is a unique solution such that the spacetime is 
regular at the center. This is shown in [65] or in [50]. This solution defines a 
neighborhood of a regular center and can be extended as long as (1 — ^r^w) 
remains positive. This can be seen as follows. 

Introduce the variables first used by Buchdahl [13] 

= 1 — ‘^r'^w , ( = e''!'^ , X = r'^ (5.38) 

Rewriting the equations in these variables and eliminating p in 5.26) and 
(5.28) gives an equation which is linear in and w, 



or 



2G 



G 



G 



(1 :pXw)(/^xx yC.x(tC + XW^a:),x ~ TTjW.xC = 0 

c c 2c 



(5.39) 



{yC,x) ,x 



G 

y 



(5.40) 



Let 0 < X < Xq be an intervall such that = (1 — ^xw > 0 and p > 0. As 
the density does not increase outwards we have w^x < 0. Therefore 



{yC,x),x < 0 

The equation (5.26) can be rewritten as 

C G 47t 

^From (5.41) and (5.42) we obtain the inequality 

w + Airp/c^ 



y > 



Wo + Airpo/c^ 



(5.41) 



(5.42) 



(5.43) 



Hence, we see that y cannot vanish before p. 

Suppose p{xb) = 0. Then we call the corresponding the radius of the 
star. The Schwarzschild solution is given in the form e~^ = e’’' = 1 — A/r for 
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some constant A. Hence, we determine a unique exterior field by the condition 
A = (^)m(rb). In this way the matter solution and the outside solution are 
joined only in a C'^-fashion because the boundary density may be non-zero. If 
we introduce Gauss coordinates relative to the hypersurface p = 0 the metric 
is C^. It is obvious that this metric cannot be extended because the area of 
the group orbits r = constant grows from 0 to infinity. 

Let us now consider the second possibility that p(x) > 0 for all x. Because 
p(x) is monotonically decreasing for x — >■ oo, lim2 ,_>oo = Poo exists. This 
implies that p' tends to 0 for x — >■ oo. Since y < 1, (5.35) then implies Poo = 0 
and hence, using the equation of state, that p — >■ 0 as x — >■ oo. As before the 
spacetime is not extensible. 

This completes our outline of the proof. It shows in particular that for for 
p(p) with p(0) = pb > 0 the radius of the star has to be finite. 

There are various exact global solutions known. (For a useful list of such 
solutions including a discussion of their physical acceptability has been given 
by Delgaty and Lake [19].) For the 1-parameter family of equations of state 
given by Equ. (5.10) the whole 1-parameter family of solutions is known. A 
2-parameter family of equations of state of interest for the issue of section 3.1 
is investigated by Simon [72]; all the corresponding exact solutions are given. 

There are some conditions on the equation of state known, which allow 
to decide whether the radius of the star is finite or infinite in the case of 
vanishing boundary density pb- In [65] it is shown that the radius of the star 
is finite if dp/p(p)^ is finite. Conversely, dp/(p(p)c~^p < oo implies 
that the matter distribution is infinitely extended. Both conditions depend 
only on the behaviour of the equation of state near the boundary p = 0. 
Makino [50] gives conditions for a finite radius in cases which are not covered 
by the above. He shows in particular, that for polytropic equations of state, 
p = const. p'^ with 4/3 < 7 < 2 the radius is finite. 

For finite distributions “Buchdahl’s inequality” holds [13]. 



Theorem: For finite distributions with non-negative density and a monotonic 
equations of state there holds 



2G M 1 
^ 9 



(5.44) 



Proof: To obtain the inequality one compares the solution with a solution of 
constant density p, an interior Schwarzschild solution. Equ. (5.40) implies for 
this solution (written with an overbar) that 

{yC,x),x = 0 = a = constant (5.45) 



We normalize C by the condition that at the boundary we have fb = yb- Then 
we find a if we rewrite (5.26) in the new varables 



SttG 

~^P = iy 

C‘‘ 
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(5.46) 
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and evaluate it at the boundary as a = ^w. 
Then (5.45) can be integrated with the result 



C(a;) = ^ 

Now (5.41) implies 

yC,x > {yQ,x)b = K,x (5.48) 

As y > y we obtain 

Cx > Cx = ^ (^1 + 2C(0) + (5.49) 

As <C is positive we obtain at the boundary 

yb>-^yb+^ (5.50) 



which is (5.44). 

Buchdahl’s inequality show that one can pack only a certain mass into a 
given fixed radius. The physical reason is that the pressure is also a source 
of the gravitational field. In Newton’s theory there are constant density balls 
with a fixed radius for arbitrary density. In Einstein’s theory the central 
pressure diverges if the density approaches some maximum value. 

In [1] an analogue of Buchdahls inequality is derived for distributions in 
which the the density is only assumed to be positive. There holds 1 — ^ > 

0 . 

Another important topic are bounds on the total mass of the system. 
Suppose we know the equation of state only for p < pQ. Then we can estimate 
the mass and radius of a core in which the density is greater po as follows: 
Clearly, m{ro) > ^po{ro)^; because of j/ > 0 we have also m(ro) < 

Hence the possible cores occupy a compact part of the m(ro)-rQ- plane. Taking 
intial values from this part one can numerically integrate outwards using the 
known equation of state, until the pressure vanishes. This was done in [27] for 
Po = 5.1 X lO^^g/cm^ and with a certain realistic equation of state for smaller 
densities. All configurations had a total mass smaller then 5Mq. It is quite 
remarkable the the knowledge of the equation of state for a finite density 
range allows to show such a bound on the total mass, assuming nothing but 
the monotonicity of the equation of state in the unknown density range. This 
is not possible in Newton’s theory. 

In the special case of bodies with a sharp edge, i.e pb > 0, we can combine 
the Buchdahl inequality (5.44) with the estimate M = m{rb) > 47rp;,r^ to 
obtain the mass bound 



M < 



/2\V 3c® 

\3y \4:TTG^Pb 



1/2 



(5.51) 
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Let us finally compare with Newton’s theory. In (5.35) it is almost obvious 
that for c —>■ oo one obtains the Newtonian equation for the pressure. The 
relativistic corrections show how ’’the pressure enters in the active and passive 
gravitational mass” . The first factor describes an effect of the geometry. Static 
fluid ball are the simplest examples of families of relativistic solutions with a 
Newtonian limit [21]. 



5.4 Spherically Symmetric, Static Einstein— Vlasov Solutions 

In recent years existence and further properties of solutions of Einstein’s 
field equations for a collisionless gas have been shown [64]. The Vlasov- 
Einstein system determines the spacetime metric and the distribution func- 
tion describing the particles. 






= (5.52) 

= STrT'^'". 

In the static spherically symmetric case and for the metric (5.24), these equa- 
tions reduce to (r= are the spatial frame components of p°‘) 



// d^if - y/l + v'^v'—dyif = 0 (5.53) 

VI -I- r 

87rGc"Vr2 = e"^(rA' - 1) -b 1 (5.54) 

S'kGc~‘^Pt‘^ = e~^{rv' -b 1) — 1 (5.55) 

where 

p{x) = p{r) = f /(a;*, n*)-\/l -b v^dv , (5.56) 

p{x)=p{r) = f f(x\v^) 

Jr 3 \ f J Vl + v^ 



The distribution function is assumed to be spherically symmetric. 

Rein and Rendall [64] show the existence of asymptotically flat solutions, 
regular at the center, with finite total mass and finite extension of the mat- 
ter and isotropic pressure. It is also possible to construct solutions with 
anisotropic pressure; Furthermore shells of finite extent of matter around 
a regular center or a black hole can be constructed [63]. 
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Abstract. The theory of gravitational leasing is discussed in a Lorentzian manifold 
setting. To that end we fix a point p (observer at a particular instant) and a time- 
like curve 7 (worldline of a light source) in a 4-dimensional Lorentzian manifold 
(spacetime) and we investigate how many past-pointing lightlike geodesics (light 
rays) go from p to 7 . If there is more than one such geodesic, then we are in a 
gravitational lensing situation. Among other things, we study the geometry of light 
cones and we use the theory of conjugate points and cut points to find necessary 
and sufficient criteria for gravitational lensing; we discuss a Morse theory, based on 
a general relativistic version of Fermat’s principle, to characterize the number of 
images for gravitational lensing situations in globally hyperbolic spacetimes; and 
we discuss gravitational lensing in asymptotically simple and empty spacetimes, 
giving an elementary proof for an odd number theorem in this situation. 



1 Introduction 

According to general relativity the path of a light ray is influenced by the 
gravitational held of massive objects. The verification of this effect during 
a total Sun eclipse in the year 1919 made Einstein’s theory famous all over 
the world. It was soon realized by Eddington [12] and Chwolson [10] that, in 
principle, this deflection of light by massive objects might lead to the effect 
that an observer sees two or more distinct images of one and the same light 
source. Also, Chwolson [10] mentioned the possibility that, in cases of axial 
symmetry, the light source might appear as a ring around the deflecting mass. 
Those effects are usually summarized under the name gravitational lensing. 
For many decades it was not clear if gravitational lensing is, indeed, realized 
in Nature. It was not before 1979 that a promising candidate for gravitational 
lensing was found. In this year Walsh, Carlswell and Weyman [81] published 
their results on the double quasar 0957 -1-561 and suggested that in this case 
we see two images of one and the same quasar, produced by the gravita- 
tional held of an intervening galaxy. Since then, a great number of further 
gravitational lens candidates have been found, including multiple quasars, 
radio rings and luminous arcs. This has led to the effect that gravitational 
lensing is one of the most rapidly developing held in astronomy, in particular 
from an observational but also from a theoretical point of view. There is a 
comprehensive monograph on the subject by Schneider, Ehlers and Falco [70] 
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and there is a great number of review articles, including a regularly updated 
electronic review by Wambsganss [82] from which literature, in particular on 
the present status of observations, can be traced. 

In this article we want to approach the theory of gravitational lensing 
from the viewpoint of Lorentzian geometry. This is somewhat unusual inso- 
far as the majority of theoretical work on gravitational lensing is done in a 
quasi-Newtonian approximation formalism which was developed, in essence, 
by Sjur Refsdal in the 1960s and which is discussed in full detail, e. g., in 
Schneider, Ehlers and Falco [70]. In the standard version of this approxi- 
mation formalism one restricts to a purely spatial description, as opposed 
to a spacetime description, and light rays are represented by straight lines 
in Euclidean 3-space, with the only exception that they may have a sharp 
bend when passing through a particular plane which is known as the “de- 
flector plane”. (There is also a variant with several deflector planes.) This 
formalism has proven very powerful for calculating particular models. On 
the other hand, one should keep in mind that it is only an approximation. 
Thus, it is perfectly fine if it is used for quantitative calculations where the 
approximative assumptions are satisfied, but it is not the complete story as 
far as qualitative aspects of the theory are concerned. Gravitational lensing, 
by its very nature, is a general relativistic effect and it can be understood 
only on the basis of a 4-dimensional spacetime description, i. e., in terms of 
Lorentzian geometry. 

Therefore, the following strategy seems to be appropriate for studying 
the theory of gravitational lensing. In the beginning one should concentrate 
on getting an understanding of gravitational lensing in terms of spacetime 
diagrams and becoming familiar with the 4-dimensional geometry involved. 
The present article tries to serve this purpose. After that, one should study 
the passage to the quasi-Newtonian formalism which involves several ap- 
proximative assumptions. Some of these assumptions are easily understood, 
such as that the gravitational held should be weak and that the deflection 
angles should be small. However, in addition one needs some assumptions 
whose interpretation is less obvious. So the passage to the quasi-Newtonian 
approximation is, in fact, a rather subtle issue. These problems are carefully 
discussed by Seitz, Schneider and Ehlers [73], related material can also be 
found in Schneider, Ehlers and Falco [70] and in Sasaki [68]. In the third step 
one is then ready to study how the quasi-Newtonian formalism is operating. 
This is what is done in the majority of the theoretical literature on gravita- 
tional lensing and what is reviewed in full detail, e. g., in Schneider, Ehlers 
and Falco [70]. For mathematical aspects of gravitational lensing in the quasi- 
Newtonian approximation formalism, in particular for the theory of caustics, 
we also refer to a forthcoming book by Fetters, Levine and Wambsganss [65]. 

For our plan to study gravitational lensing in a Lorentzian geometry set- 
ting we assume that light propagation can be described in terms of rays and 
we restrict to light rays in vacuo, i. e., we exclude the case that the light rays 
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are influenced by a medium, e. g., in terms of diffraction, on their way from 
the light source to the observer. Moreover, we shall restrict to the case that 
both the observer and the light source may be considered as pointlike. We are 
then naturally led to studying past-pointing lightlike geodesics (light rays) 
from a point (observer at a particular instant) to a timelike curve (worldline 
of the light source) in a Lorentzian manifold (spacetime). If there are two 
or more such geodesics, then we are in a gravitational lensing situation. In 
essence, our analysis will be kinematical throughout, although Einstein’s held 
equation will be mentioned occasionally. 

The article is organized as follows. In Sect. 2 we recall some basic notions 
from Lorentzian geometry and fix some conventions as to terminology and 
notation. These conventions will be essential for understanding the following, 
so the reader is kindly requested to read this section carefully. In Sect. 3 
we study gravitational lensing situations in arbitrary spacetimes. In Sect. 4 
we specialize to the case of globally hyperbolic spacetimes and in Sect. 5 we 
further specialize to asymptotically simple and empty spacetimes. 

Many mathematical results will be simple corollaries of standard theorems 
from Lorentzian geometry which can be found in the books by Hawking 
and Ellis [29], Wald [80], O’Neill [50], or Beem, Ehrlich and Easley [4]. For 
theorems proven in one of those books the proof is not repeated here unless 
in cases where this seemed instructive. Also, the material on Morse theory, i. 
e.. Sects. 3.5 and 4.2, turned out to be so technical that for proves the reader 
must be refered to the quoted original papers. In all other cases proves are 
given in (hopefully) sufficient detail. 



2 Some Basic Notions of Spacetime Geometry 

According to general relativity, a spacetime is a 4-dimensional Lorentzian 
manifold. For convenience we shall consider only Lorentzian manifolds that 
are time-orientable, i. e., we assume that it is possible to distinguish between 
future and past in a globally consistent way. More precisely, we use the fol- 
lowing definition. 

Definition 1. A spacetime is a triple {A4, g,T^) where 

(a) A4 is a connected 4-dimensional real C°° manifold whose topology satis- 
fies the axiom of second countability and the Hausdorff axiom and is, thus, 
paracompact; 

(b) g is a Lorentzian metric on M, i. e., a symmetric covariant second rank 
C°° tensor field which has signature (-I-, -I-, -I-, — ) at each point; 

(c) T’*' is a time orientation for (A4,g), i. e., the set of timelike tangent vectors 
{ X € Tj\4 I g(X, X) < 0 } consists of exactly two connected components and 

is one of those components. 

For a spacetime (A4, g,'T'^), we denote the Levi-Civita connection of 5 by V 
and we denote the Riemannian curvature tensor of V by R. 
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A linear subspace N of the tangent space TpAi is called (i) spacelike if g 
is positive definite on TV, (ii) lightlike if g is positive semi-definite but not 
positive definite on TV, and (iii) timelike otherwise. The property of being 
spacelike, lightlike or timelike is assigned to a vector in TpM. if the linear 
space generated by this vector has the respective property, to a differentiable 
curve if its tangent vector has the respective property everywhere and to a 
submanifold of Ai if its tangent space has the respective property everywhere. 
Moreover, a differentiable curve is called causal if its tangent vector is either 
timelike or lightlike at each point. 

Here and in the following, by a curve in Ai we mean a map from a real 
interval I into Ai. (A “real interval” is a connected subset of R that contains 
more than one point. / may be open, half-open or closed.) Quite generally, 
we shall use a lower case greek letter for a curve, e. g., 7 : / — >■ Ai, and 
we shall use the corresponding boldface letter for the image set of this curve, 

e- g-, 7 = {7(5) I s e / }. 

By a geodesic we always mean what is more fully called an “affinely para- 
metrized geodesic”, i. e., a C°° curve A : I — >■ Ai such that Va'A' = 0. This 
leaves the freedom of changing the parameter affinely, I — > I , s 1 — > as + b 
with a ,6 G R, a yf 0. However, when counting geodesics we shall tacitly 
identify two geodesics if one is a reparametrization of the other. That is to 
say, in a sentence such as “There are two geodesics Ai and A 2 ...” it goes 
without saying that A 2 is not just a reparametrization of Ai. Please note 
that, according to this rule, a periodic geodesic gives rise to infinitely many 
geodesics between any two points on this geodesic. 

Our study will be concentrating upon lightlike geodesics, which are to 
be interpreted as light rays. It is well-known and easily verified that under 
a conformal transformation g 1 — > g of the metric g, with an arbitrary 

function / : Ai — > R, the lightlike geodesics undergo a reparametriza- 
tion but are unchanged otherwise. Since we are interested only in the paths 
of lightlike geodesics and not in their particular parametrizations, we could 
therefore allow for arbitrary conformal transformations of the metric, i. e., 
we could prescribe a conformal equivalence class rather than a particular 
metric. However, we shall not do so because we want to occasionally discuss 
additional assumptions on the spacetime which are not conformally invariant, 
such as, e. g., conditions on the Ricci tensor. 

The totality of all geodesics issuing from a point p give rise to the expo- 
nential map 

expp : Wp — > Ai . (1) 

This map is defined on a subset Wp of the tangent space TpAi by setting 
expp(A') = A(l) where A : [0,1] — >■ Ai is the geodesic with A'(0) = X. 
It is well known that the maximal domain Wp on which this map is well- 
defined is an open subset of TpAi that contains the origin. In general, Wp 
is not all of TpAi, thereby reflecting the fact that a geodesic may arrive at 
the “boundary” of Ai (using the word “boundary” in a colloquial manner) 
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before its affine parameter has reached the value 1. In general, the map exp^ 
need not be injective on its maximal domain Wp, thereby indicating that 
the geodesics isssuing from p may reconverge and, eventually, intersect each 
other. However, it is well known that there is an open neighborhood W° of 
the origin in TpA4 such that the restriction of exp^ to W° is a diffeomorphism 
onto its image. Thus, sufficiently short pieces of geodesics issuing from p do 
not intersect. A neighborhood of p that is contained in the image of W° under 
expp is called a normal neighborhood. 

This notion can be used to assign the property of being timelike or causal 
to continuous curves which need not be differentiable. A curve 7 : I — > M. 
is called timelike (or causal, respectively) if it is continuous and if each s G I 
has a neighborhood / in / such that for any two parameter values si and 
S 2 in I there is a timelike (or causal, respectively) geodesic from 7 (si) to 
7 ( 52 ) which is completely contained in a normal neighborhood of 7 ( 5 ). If 
this geodesic is future-pointing whenever si < S 2 , 7 is called future-pointing; 
otherwise 7 is called past-pointing. 

Moreover, we shall frequently use the following standard definition. 

Definition 2. For a point p in a spacetime (M,g, T^), we define the chrono- 
logical future I'^{p) (and the chronological past I~{p), respectively) of p as 
the set of all points q G Ai that can be reached from p along a future-pointing 
(or past-pointing, respectively) timelike curve. 

I+(p) and I~{p) are obviously open subsets of M for every p G M. However, 
as long as the causal structure of spacetime has not been restricted this is 
more or less the only statement that can be made about these two sets. The 
following causality notions will be of relevance for us. 

Definition 3. (a) A spacetime {Ai,g,T'^) is called causal at a point p G Ai 
if there is no closed causal curve through p. 

(b) A spacetime {Ai, g,T'^) is called future- distinguishing (or past-distin- 
guishing, respectively) at p if every neighborhood U of p contains a neigh- 
borhood V of p such that a future-pointing (or past-pointing, respectively) 
causal curve from p that has left U cannot reenter V. An equivalent condi- 
tion is that the equation I~^{p) = A'^{q) (or the equation I~(p) = X~{q), 
respectively) implies the equation p = q. 

(c) A spacetime {Ai, g,T~^) is called strongly causal at p if every neigh- 
borhood U of p contains a neighborhood V of p such that no causal curve 
intersects V more than once. 

It is easy to check that the strong causality condition implies both the future- 
distinguishing and the past-distinguishing condition and that either distin- 
guishing condition implies the causality condition. For a rather detailed dis- 
cussion of these well known notions we refer to Hawking and Ellis [29], to 
O’Neill [50] and to Beem, Ehrlich and Easley [4]. In particular, illustrative 
examples of spacetimes satisfying some causality assumptions but violating 
others are given in Figs. 37 and 38 of Hawking and Ellis [29]. 
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3 Gravitational Leasing in Arbitrary Spacetimes 

In this section we discuss the geometry of gravitational lensing situations 
in arbitrary spacetimes {M, g,'T^). To that eird we fix a point p G M aird 
a timelike C°° curve 7 : I — > A 4 . We interpret p as an event where an 
observation takes place (i. e. “an astronomer here and now in his observa- 
tory”) and we interpret 7 as the worldline of a light source, e. g., a distant 
quasar. This interpretation is, of course, based on the assumption that the 
spatial extension of the light source and of the observer can be neglected, 
i. e., that they can be considered as pointlike. Our assumption of 7 being 
timelike means that the light source moves at a sublriminal velocity. There 
is iro need to specify the parametrizatioir of 7 (e. g., to proper time para- 
metrization glpf' ,"f') = —1) siirce we are primarily interested iir the set 7 aird 
irot iir a particular parametrization. - The questioir we want to discuss is the 
followiirg (see Figs. 1 and 2). 

How many past-pointing lightlike geodesics are there that start at 

the point p aird terminate on the worldline 7 ? 




Fig. 1. This is a typical spacetime diagram of a multiple imaging situation. In 
correspondence with the three past-pointing lightlike geodesics from p to 7, an 
observer at p would see three images of a light source with worldline 7, as indicated 
in the insert. Actually, one of the three images is hidden behind the deflector if the 
latter is non-transparent. 



According to the rules of general relativity, every lightlike geodesic can be 
interpreted as a light ray traveling under the influence of gravity alone. (As 
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outlined in the Introduction, we shall not be concerned with light rays influ- 
enced by a medium throughout this text.) Thus, each past-pointing lightlike 
geodesic from p to 7 gives rise to an image of the light source 7 at the ce- 
lestial sphere of the observer p. To justify this interpretation one may view 
each lightlike geodesic as representing a thin bundle of almost parallel light 
rays which is focused by the observer’s eye lense onto his or her retina. - The 
following cases are to be distinguished. 

Case A: There is no past-pointing lightlike geodesic from p to 7. Then the 
observer at p does not see any image of the light source 7. Situations 
of this kind are far from being unusual. They may occur even for an 
inextendible worldline 7 in Minkowski space, viz., if 7 asymptotically 
approaches the past light cone of p. Please note that, in general, the non- 
existence of a past-pointing lightlike geodesic from p to 7 does not imply 
the non-existence of a past-pointing causal curve from p to 7. In other 
words, even if p cannot receive a freely traveling light ray from 7 it is very 
well possible that p can be causally influenced by 7. The Godel cosmos 
provides an interesting example of this kind, see, e. g. Hawking and Ellis 
[29], Sect. 5.7. In this spacetime any two points can be joined by a past- 
pointing causal curve; however, the lightlike geodesics issuing from some 
point are restricted to a cylindrical region. 

Case B: There is exactly one past-pointing lightlike geodesic from p to 7. 
Then the observer at p sees exactly one image of the light source 7. This 
is the situation naively taken for granted in pre-relativistic astronomy. 
Case C: There are at least two but not more than denumerably many past- 
pointing lightlike geodesics from p to 7. Then the observer at p sees finitely 
or infinitely many distinct images of 7 at his or her celestial sphere. In view 
of Einstein’s held equation one may think of a heavy mass ( “deflector” ) , 
occupying a worldtube between 7 and p, whose gravitational held causes a 
bending of light rays. Figure 1 shows a typical three-image-conflguration. 
Please note that, generically, different lightlike geodesics from p to 7 in- 
tersect the worldline 7 at different points. In other words, the various 
images seen at p show the light source at different ages. Astronomers use 
the term time delay for this phenomenon. 

Case D: There are more than denumerably many past-pointing lightlike geo- 
desics from p to 7. E. g., there may be a continuous one-parameter family 
of lightlike geodesics from p to 7 such that the light source 7 appears at 
the celestial sphere of the observer p as an arc or, in situations of axially 
symmetry, as a ring, see Fig. 2. Such rings are often called Einstein rings 
although Chwolson [10] and not Einstein was the first to mention this 
phenomenon. We shall prove later in Proposition 12 that all members of 
a continuous one-parameter family of light rays from p to 7 necessarily 
meet 7 at the same point. In other words, all parts of such an extended 
image show the light source at the same age. 
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Fig. 2. In the situation depicted in this spacetime diagram there is a one-parameter 
family of past-pointing lightlike geodesics from p to 7 . Correspondingly, an observer 
at p would see an extended image, such as an arc, of a light source with worldline 
7 . The insert indicates what is seen by the observer at p. 



Whenever Case C or Case D occurs astronomers speak of multiple imag- 
ing by the gravitational lens effect. In Proposition 10 below we shall prove 
that Case D is “exceptional” in the sense that, under a small perturbation 
of the point p (keeping the worldline 7 fixed), Case D always disappears. 
In this sense, Case C is the “generic” multiple imaging case. However, this 
does not mean that Case D is of no interest from a physical point of view. 
First, a systematic investigation of Case D situations in a spacetime will give 
interesting information on Case C situations. Second, the “non-genericity” 
of Case D situations is based on our idealization to view the light source 
and the observer as spatially non-extended, i. e., as pointlike. For extended 
light sources it is not true that Case D situations can be destroyed by an 
arbitrarily small perturbation. That is the reason why, actually, astronomers 
do observe arcs and rings produced by gravitational lensing. 

It is now our goal to characterize spacetime geometries that lead to mul- 
tiple imaging, with special emphasis on the difference between Case C and 
Case D situations. This will be done in the next three subsections. We begin 
with a review of the notions of conjugate points and of cut points in Sect. 3.1. 
These notions will be instrumental to study the local and global geometry of 
light cones in Sect. 3.2 and, thereupon, to give criteria for multiple imaging in 
Sect. 3.3. Sections 3.4 and 3.5 are devoted to a version of Fermat’s principle 
on arbitrary spacetimes that is very useful in studying gravitational lensing 
situations. 
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3.1 Conjugate Points and Cut Points 

The notion of conjugate points is used to characterize the situation that 
neighboring geodesics undergo a (partial) focusing effect. We are interested 
in lightlike geodesics in a spacetime {Ai,g, T~^) that pass through a particular 
point p. (In applications to gravitational lensing this will be the point where 
the observation takes place and we shall be interested in lightlike geodesics 
that issue from p into the past.) In other words, we are interested in C°° 
curves A : I — A4 that satisfy the conditions 



where Sq denotes a fixed parameter value Sq G I. 

To investigate the behavior of geodesics which are close to A we consider 
a one-parameter variation of A, i. e., a C°° map g :]—eo,So[x I — > M with 
ry(0, •) = A where £o is some positive real number. We assume that not only 
A but also all the varied curves r]{£,-), for 0 < |er| < |eo|, satisfy the three 
conditions (2), (3), (4). By differentiation with respect to the variational 
parameter e this implies that the variational vector field J : / — > TA4, 
which is defined by J(s) = r]{-, s)^(O), satisfies the three conditions 



see Fig. 3. For any vector field J along A that satisfies these three condi- 
tions, one may think of the “arrow-head” of J as tracing a neighboring light- 
like geodesic through p in linear approximation. (5) is called the equation of 
geodesic deviation or the Jacobi equation and any solution J of this equation 
is called a Jacobi field along A. 

The equations (5), (6) and (7) are obviously satisfied by any multiple of 
the tangent field, J{s) = f{s) A'(s), with f{so) = 0. Such a solution of (5), 
(6) and (7) is called trivial since it represents an infinitesimally neighboring 
geodesic which is just a reparametrization of A. We are now ready to define 
the notion of conjugate points. For si £ I \ {so}, one says that the point 
q = A(si) is conjugate to p = A(so) along A if there is a non-trivial solution J 
of (5), (6) and (7) such that J(si) is parallel to A'(si). It is easy to check that 
the conjugacy of q to p along A is independent of which affine parametrization 
has been chosen for A. Also, it is known that in a compact section of a lightlike 
geodesic there are at most finitely many points conjugate to a given point 
p, see, e. g., Beem, Ehrlich and Easley [4], Theorem 10.77. (The same result 



Va'A' = 0, 
5(A',A') = 0, 

A(so) = P, 



( 2 ) 

( 3 ) 

( 4 ) 



Va'Va'J- 7?(A', J,A') =0, 
p(A',VvJ) =0, 

J{So) = 0 , 



( 5 ) 

( 6 ) 
( 7 ) 
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Fig. 3. The solutions J of (5), (6) and (7) form an “infinitesimal bundle of light 
rays” around A. The timelike dimension is suppressed in the picture. 



is true for timelike geodesics as well, but not for spacelike ones. An example 
where a whole interval is conjugate to a point along a spacelike geodesic 
was contrived by Heifer [30]. The reader is cautioned against a proof that 
conjugate points are always isolated, along any geodesic in a semi-Riemannian 
manifold of any signature, suggested by O’Neill [50], Exercise 8, p. 299. This 
is one of the very few mistakes in this otherwise excellent text-book; a basis 
of Jacobi fields with the desired properties need not exist.) 

It follows directly from the definitions that a conjugate point indicates a 
partial focusing effect in the following sense. If, for a lightlike geodesic A, the 
point A(si) is conjugate to the point A(so), then a one-parameter family of 
lightlike geodesics issuing from A(so) is being refocused into the point A(si) 
to within linear approximation. 

In addition, conjugate points indicate that a geodesic loses its extremizing 
property, according to the following well-known proposition. 

Proposition 1. Let A : I — > M be a lightlike geodesic in a spacetime 
{M, g,'T'^) and consider two different parameter values So and si in I. If, 
for some s €]so,si[, the point A(s) is conjugate to \{so) along A, then there 
is a C°° variation o/A|[s^_sj] such that all varied curves are timelike curves 
from A(so) to A(si). Conversely, the existence of such a variation implies 
that there must be a point A(s) conjugate to A(so) in the half-open interval 
s e]so,si]. 

For a proof we refer to Hawking and Ellis [29], Proposition 4.5.11 and Propo- 
sition 4.5.12. It might be helpful to consult O’Neill [50], Chap. 10, Proposition 
48, in addition. 

Proposition 1 says that an observer moving at subluminal velocity may 
catch up with a light ray A after the latter has passed through a conjugate 
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point and that, for observers staying close to A, this is impossible otherwise. 
Here the restriction to observers staying close to A is essential. Observers 
“taking a short cut” may very well catch up with a lightlike geodesic even if 
the latter is free of conjugate points. For investigating the extremizing prop- 
erty of a lightlike geodesic from a global point of view, not just with respect 
to neighboring curves, the notion of conjugate points is not appropriate. In- 
stead, one has to consider the notion of “cut points” which was introduced in 
Riemannian geometry (i. e., for positive definite metrics) by Poincare [66] for 
a special situation and by Whitehead [85] in generality. For lightlike geodesics 
in a Lorentzian manifold, this notion can be introduced in the following way, 
cf. Beem, Ehrlich and Easley [4]. 

For any two points p and g in A4 , we denote by d{p, q) the Lorentzian 
pseudo-distance between p and q, i. e. 

d{p, q) = supr [ J \g{/3'{s), P'{s))\ ds , (8) 

13 Jo ^ 

where the supremum is to be taken over all causal curves (3 : [0, 1] — with 
/3(0) = p and /3(1) = q. Owing to the well-known fact (cf. Beem, Ehrlich and 
Easley [4], p. 75) that any causal curve is differentiable almost everywhere, it 
is not necessary to restrict to differentiable causal curves to make sure that 
the integral in (8) does exist. Whenever p and q are causally related, such 
that the supremum is to be taken over a non-empty set, the existence of the 
supremum is guaranteed but it may be infinite. 

Now let A : / — ^ be a past-pointing lightlike geodesic. For So and 

Si in I with So < si, A(si) is called the past cut point of A(so) along A if 
d(A(so), A(s)) = 0 for s G ]so,si] and d(A(so), A(s)) > 0 for all s G / with 
s > si. Thus, the past cut point occurs where A loses its extremizing property 
among causal curves with respect to the pseudo-distance. In this situation, 
for s > Si the point A(s) can be reached from A(so) along a past-pointing 
causal curve which is not a lightlike geodesic. By a well-known theorem (see, 
e. g.. Hawking and Ellis [29], Proposition 4.5.10) this implies that A(s) can 
be reached from A(so) along a past-pointing timelike curve. Thus, beyond the 
past cut point A intersects some past-pointing timelike curve which started 
together with A at A(so). 

It is not difficult to check that the past cut point of p along A is inde- 
pendent of which past-pointing affine parametrization has been chosen for A. 
Also, it follows directly from the definition that the past cut point is unique 
if it exists. The non-existence of the past cut point may have two quite dif- 
ferent reasons. Either d(A(so), A(s)) = 0 for all s G / with s > So, i. e., the 
extremizing property is always preserved; or d(A(so), A(s)) > 0 for all s G / 
with s > So, i. e., the extremizing property never holds. Clearly, the latter 
case is possible only if the past distinguishing condition of Definition 3 (b) is 
violated. In other words, the past-distinguishing property guarantees that a 
sufficiently short past-pointing lightlike geodesic is extremizing. 
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There is, of course, a completely analogous definition of the future cut 
point along a lightlike geodesic. However, we shall concentrate upon the past 
cut point because this will be the relevant notion in view of gravitational 
leasing situations. 

In Riemannian geometry the notions of cut points and conjugate points 
are related by the easily remembered rule: “The cut point comes first”, see, 
e. g., Klingenberg [35], Proposition 2.1.7. The proof of this result is based 
on the well-known fact that a sufficiently short Riemannian geodesic always 
extremizes the Riemannian distance. We have just seen that for lightlike 
geodesics in Lorentzian manifolds the analogous fact need not be true unless 
the distinguishing property is satisfied. Therefore, the rule “The cut point 
comes first” can be proven for distinguishing spacetimes only. The precise 
statement reads as follows. 

Proposition 2. Let A : I — > Ai he a past-pointing lightlike geodesic in a 
spacetime that satisfies the past-distinguishing property at the point p = A(so) . 
Assume that, for some parameter value Si > Sg, the point A(si) is conjugate 
to A(sg) along A. Then the past cut point A(s) o/ A(sg) along A exists and it 
is So < s < si- 

Proof. Since the past-distinguishing property is satisfied at p, the equation 
d[p,X{s)) = 0 holds for s € [so,Sq -k e[ whenever e is a sufficiently small 
positive number. By Proposition 1, d{p, A(s)) > 0 for s € ] si , si -I- 5 [ for 
arbitrarily small positive 6. Thus, the past cut point must lie in the parameter 
interval jsg, si] • ^ 

We end this subsection with a proposition saying that, under the past- 
distinguishing assumption, any intersection of two past-pointing lightlike 
geodesics starting from a point p is indicated by the occurence of a past 
cut point on each of those geodesics, please cf. Beem, Ehrlich and Easley [4], 
Lemma 9.13. 

Proposition 3. Let {M, g,T^) be a spacetime that satisfies the past-distin- 
guishing condition at a point p £ AA. Assume that there is a point q £ M 
that can be reached from p along two past-pointing lightlike geodesics. Then 
the past cut point of p exists on each of those geodesics {and it comes on or 
before q ) . 

Proof. We may parametrize the two past-pointing lightlike geodesics such 
that Ai(0) = A2(0) = p and Ai(l) = A 2 (l) = q. Then A'^(l) and A 2 (l) are 
linearly independent since otherwise A 2 would be a reparametrization of Ai. 
This follows from the uniqueness theorem for the geodesic equation and from 
the fact that, owing to our past-distinguishing condition, a closed lightlike 
geodesic through p cannot exist. Then, for small positive e, the curve Ai|[op] 
joined to the curve A 2 |[iq+£] gives a causal curve which is not an (unbroken) 
lightlike geodesic. By a well-known theorem (see Hawking and Ellis [29], 
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Proposition 4.5.10) this implies that the point A 2 (l + e) can be reached from 
p = Ai(0) by a timelike curve, thus d(p, A 2 (l + e)) > 0. On the other hand, 
the past-distinguishing condition at p guarantees that d[p,X 2 {so -I- <5)) = 0 
for small positive S. Hence, the past cut point of p along A 2 comes on or 
before q = A 2 (l). □ 

Moreover, one might ask if the past cut point itself can be reached from 
p along a second past-pointing lightlike geodesic. A theorem to that effect 
can be proven only under the assumption of global hyperbolicity and will be 
postponed until Sect. 4, see Proposition 14 below. In the Riemannian case, an 
analogous result holds on complete Riemannian manifolds and is the content 
of the celebrated Poincare Theorem, proven by Poincare [66] for a special 
case and by Whitehead [85] in its generality. It is this property that gave rise 
to the name “cut point” . 

3.2 The Geometry of Light Cones 

In a spacetime, the lightlike geodesics issuing from a point p into the past 
make up the socalled past light cone of p. In general, the past light cone need 
not be an immersed (let alone embedded) submanifold of A4, i. e., even its 
local structure may be very complicated. The failure of past light cones to be 
submanifolds is crucial for gravitational lensing. In this subsection we use the 
notions of conjugate points and cut points to investigate whether the past 
light cone is an immersed or embedded submanifold. 

Please recall that the totality of all geodesics issuing from a point p are 
given in terms of the exponential map (1). For our study of gravitational 
lensing we are interested in lightlike geodesics issuing from p into the past. 
Therefore, we restrict the exponential map to the 3-dimensional submanifold 

C~ = {a G yVp \ {0} I X is lightlike and past-pointing } (9) 

of TpA4 ■ In (9) Wp C TpAi denotes the maximal domain of exp^ . For the 
sake of convenience, we introduce the abbreviation 

e~ = expp|g_ :C~ — >M (10) 

for the restriction of the exponential map to C~ . The image of this map e~ is 
the past light cone of p, i. e., the set of all events q € M that can be reached 
from p along a past-pointing lightlike geodesic. This set determines what is 
visible for an observer at p. In particular, it determines whether p observes a 
gravitational lensing situation. 

e~ is a C°° map from a 3-dimensional manifold into a 4-dimensional 
manifold. Thus, at any X G C~ the rank of the differential Txe~ cannot 
be bigger than 3. If the rank is equal to 3, e~ is an immersion at X, i. e., 
the past light cone with vertex p is an immersed submanifold near the point 
e~{X). This is, of course, necessarily the case for vectors X G C~n>V°, where 
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Wp denotes the domain on which the exponential map is a diffeomorphism. 
This reflects the well-known fact that the past light cone with vertex p is a 
3-dimensional manifold if we restrict to sufflciently short lightlike geodesics 
issuing fom p. For X G C~ \ Wp, however, the number m = 3 — rank(Txep ) 
may be bigger than 0. Comparison with the preceding subsection shows that 
this is the case if and only if the point e~{X) is conjugate to p along the 
geodesic generated by X. The number m is called the multiplicity of this 
conjugate point. Since, obviously, {Txe~){X) ^ 0, the multiplicity m may 
be either 1 or 2. It is important to realize that, in any case, the image of 
the differential Txe~ is a lightlike subspace. For a proof it suffices to realize 
that (2), (6) and (7) imply g{X',J) = 0, so the tangent vector of A and 
all “connecting vectors” with infinitesimally neighboring lightlike geodesics 
starting from the same point span a lightlike subspace. (Here we make use of 
the well-known fact that a timelike vector cannot be orthogonal to a lightlike 
vector, i. e., that the equations g{X',X') = 0 and g{X',J) = 0 imply the 
equation p( J, J) > 0 .) In particular, this proves the well-known fact that the 
light cone is a 3-dimensional lightlike submanifold at each point where Txe.p 
has maximal rank. 

The union of all points which are conjugate to p, along any past-pointing 
lightlike geodesic issuing from p, is called the past lightlike conjugate loeus of 
p or the caustic of the past light cone of p. In other words, the caustic is the 
set of all points where the past light cone fails to be an immersed submanifold 
of . At caustic points, the light cone typically forms edges or vertices whose 
geometry might be arbitrarily complicated. If one restricts to caustics which 
are stable against perturbations in a certain sense, then a local classification of 
caustics is possible with the help of Arnold’s singularity theory of Lagrangian 
or Legendrian maps, see Arnold, Gusein-Zade and Varchenko [3] or Arnold 
[2]. This formalism has been applied to wavefronts in general relativity, a 
notion which includes light cones as special cases, by Friedrich and Stewart 
[18], by Hasse, Kriele and Perlick [28] and, in a particularly elegant way, by 
Low [43]. (In [28] the proof of Theorem 4.4 is incorrect. A corrected version is 
going to appear.) In the case of globally hyperbolic spacetimes the formalism 
of Low even allows to tackle the problem of globally classifying the caustics 
of light cones, although this has not been carried through until now. For the 
sake of comparison the reader should also consult Fetters’ work [62] [64] [65] 
on caustics in the quasi-Newtonian approximation formalism of gravitational 
lensing. Unfortunately, the subject of classifying stable caustics is so technical 
that we cannot go into this matter here for lack of space. 

It is important to realize that a light cone may fail to be an embedded 
submanifold of Ai even if its caustic is empty. At the end of this subsection we 
shall illustrate this claim by an example where a light cone develops transverse 
self-intersections without ever failing to be an immersed submanifold of Ai, 
see Fig. 4 below. The relevant notion for finding out whether a light cone 
is an embedded submanifold is the notion of cut points, and not the notion 
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of conjugate points. To work this out, we have to take a closer look at the 
chronological past I~{p) of a point p, please recall Definition 2. 

In Minkowski space, the lightlike geodesics issuing from p into the past 
make up the boundary of X~(jp). In spacetimes with a complicated 

causal structure, however, those lightlike geodesics may penetrate into the 
open set I~{p), i. e., the past light cone oip may have a non- void intersection 
with X~(p). In spacetimes with drastic causality violations (such as, e. g., 
the Godel cosmos, see Hawking and Ellis [29], Sect. 5.7) X~{p) may even 
be all of Ai such that dX~{p) is empty and the past light cone of p is 
completely contained in X~ (j>). Quite generally, dX~ {p) can be characterized 
in the following way. 

Proposition 4. For any point p in a spacetime, the set dX~(jp) is either 
empty or a 3-dimensional achronal closed embedded C^~ submanifold of Ai. 
(A subset of a spacetime is achronal if it is impossible to connect any two 
of its points by a timelike curve. A C^~ manifold is a topological manifold 
whose transition maps satisfy a Lipschitz condition.) 

For a proof we refer to Hawking and Ellis [29], Proposition 6.3.1. With the 
Lorentzian distance function d defined by (8), we get the following result for 
a past-pointing lightlike geodesic A with A(so) = p. A point A(s) with s > So 
is in the boundary ofX~{p) if c?(p , A(s)) = 0 and it is in the open set X~{p) if 
d(p, A(s) > 0. Thus, the past cut point of p along a lightlike geodesic can be 
characterized as the point where this geodesic leaves the boundary of X~ (p) 
and penetrates into the open set X~{p). The set of all past cut points of p 
along lightlike geodesics through p is called the past lightlike cut locus of p. 
We shall now prove that in past-distinguishing spacetimes the failure of a 
past light cone to be an embedded submanifold is indicated by a non-empty 
past lightlike cut locus. 

Proposition 5. Assume that the spacetime {At, g,'X~^) satisfies the past- 
distinguishing property at a point p. If the past lightlike cut locus of p is 
empty, then the map e~ defined in (10) is a C°° embedding, i. e., the past 
light cone of p is an embedded C°° submanifold of At . 

Proof. If the past lightlike cut locus is empty. Proposition 2 implies that no 
past-pointing lightlike geodesic starting at p can have a point conjugate to p. 
Hence, the map e~ is a C°° immersion. Together with the past-distinguishing 
condition, the same assumption implies that such a geodesic must stay on 
dX~{p) forever, i. e., the past light cone of p must be completely contained 
in dX~{p). But then Proposition 4 guarantees that the past light cone has 
no self-intersection and no almost self-intersection. Hence, e~ must even be 
an embedding. □ 

In Sect. 4 below we shall prove that the converse of this proposition is true in 
globally hyperbolic spacetimes, see Proposition 15. - We now illustrate the 
properties of conjugate loci and cut loci with three examples. 
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Example 1: 

Figure 4 shows the past light cone of a point p in a spacetime with a non- 
transparent deflector. To have a concrete example, the reader may consider 
the spacetime metric 

g = -dt^ + dz^ + dr"^ + dp^ , (11) 

with some constant 0 < /c < 1, on x \ {0}). Here (t, z) denote 

Cartesian coordinates on R^ and (r, p) denote polar coordinates on R^ \ {0}. 
This can be interpreted as the spacetime around a static non-transparent 
string, see Vilenkin [79], Hiscock [32] and Gott [24]. (Vilenkin in his pioneer- 
ing paper discussed this metric in connection with the linearized Einstein 
held equation; it was then realized independently by Hiscock and Gott that 
Vilenkin’s results remain true even if the full Einstein equation is used.) One 
should think of the string as being situated at the z-axis. Since the latter is 
not part of the spacetime, it is indeed justified to speak of a non-transparent 
string. It is easy to see that the metric (11) induces on each plane t = const., 
z = const, the geometry of a cone; i. e., this metric has a “conic singularity” 
along the z-axis. 

It is an instructive exercise to verify that in this string spacetime each past 
light cone qualitatively looks like the one depicted in Fig. 4. (Glearly, in Fig. 
4 one spatial dimension is suppressed. This missing dimension corresponds 
to the z direction in the string example. The fat vertical line in Fig. 4 
actually indicates that a two-dimensional world sheet has been excised from 
spacetime, viz., the total history of the z-axis in the string example.) The 
caustic of this light cone is empty, i. e., there are no points conjugate to 
p along any past-pointing lightlike geodesic from p. The past lightlike cut 
locus of p, however, is not empty, thereby illustrating our earlier claim that a 
light cone may fail to be an embedded submanifold without failing to be an 
immersed submanifold. Moreover, Fig. 4 nicely exemplifies our general result 
that each past-pointing lightlike geodesic from p enters I~ (p) exactly when 
passing through the cut locus. 

Example 2: 

We now modify Example 1 by switching to a transparent deflector. In the 
case of the string metric (11) this can be done by changing the metric in the 
neighborhood of the z-axis in such a way that there is no longer a singularity, 
i. e., by “rounding off the tip of the cone” which represents each plane t = 
const., z = const. The region around the z-axis in which the metric has been 
changed can then be interpreted as the interior region of a transparent string. 
The resulting light cone looks like the one depicted in Fig. 5. The fact that 
now there are new lightlike geodesics (in comparison to Fig. 4) that pass 
through the interior region of the deflector gives rise to the formation of 
conjugate points, i. e., the light cone is no longer everywhere an immersed 
submanifold of A4 . More precisely, the light cone develops two cuspidal edges 
that meet in a so-called swallow-tail at the point denoted by q in the figure. 
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Fig. 4. In a spacetime with non-transparent deflector, e. g., a non-transparent 
string, light cones may develop self-intersections without failing to be immersed 
submanifolds. It is geometrically evident that the situation depicted here gives rise 
to double imaging for an observer at p . 



These two cuspidal edges (including the point q) make up the caustic of the 
light cone. The point q also belongs to the cut locus which looks quite similar 
to the one in Fig. 4. A special role is played by the past-pointing lightlike 
geodesic A starting from p that passes through the point q. (In Fig. 4 this 
geodesic was blocked by the deflector, so it did not reach a point analogous to 
q.) g is conjugate to p and, at the same time, the past cut point of p along A. 
All neighboring geodesics emanating from p pass through the cut point first 
(where they enter into X^{p)) and reach their first conjugate point afterwards 
(where they smoothly slip over the cuspidal edge). Again we emphasize that 
one spatial dimension is suppressed in Fig. 5. 

Example 3: 

By excising a neighborhood of the point q from the spacetime in Fig. 5 we 
are led back to a light cone of the kind considered in Fig. 4. It is more in- 
teresting to leave a neighborhood of q untouched and instead to remove a 
worldline from spacetime that intersects the lightlike geodesic A between p 
and q. (This may then be reinterpreted as the worldline of a non-transparent 
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deflector, although some adjustments are necessary to reconcile this new in- 
terpretation with Einstein’s field equation.) The interesting new feature of 
this modified example is that the geodesic A is blocked before it reaches the 
point q (contrary to Fig. 5) but that the point q is still part of the spacetime 
(contrary to Fig. 4). It is obvious that in this new situation the past light 
cone of p united with {p} is no longer a closed subset of M since q is in 
the closure of the light cone but does not belong to it. By the same token, 
the past lightlike conjugate locus and the past lightlike cut locus of p are no 
longer closed subsets of A4. In Sect. 4 we shall see that this is possible only 
in spacetimes that are not globally hyperbolic. 




Fig. 5. In a spacetime with a transparent deflector, e. g., a transparent string, fight 
cones typically fail to be immersed submanifolds. It is geometrically evident that 
the situation depicted here gives rise to triple imaging for an observer at p, cf. Fig. 

1 . 



These three examples are quite instructive but they are, of course, not 
universal in view of gravitational lensing situations. Completely new features 
may occur if the missing spatial dimension is taken into account, in particular 
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in spacetimes without symmetry. Also, causality violations or a non-trivial 
topological structure of spacetime (apart from a “hole” that is meant to 
model a non-transparent deflector) may change the global features of light 
cones dramatically. 



3.3 Citeria for Multiple Imaging 

We are now ready to turn to the discusssion of multiple imaging situations. 
To find out how many images an observer at p would see of a light source 
with worldline 7 we have to determine the intersection of 7 with the past 
light cone of p. The following proposition shows that in past-distinguishing 
spacetimes the occurence of cut points is necessary for multiple imaging. 

Proposition 6. Assume that the spacetime (M, g, T^) is past- distinguishing 
at a point p and that the past lightlike cut locus of p is empty. Then for every 
timelike C°° curve 7 : / — > A 4 there is at most one past-pointing lightlike 
geodesic that starts at p and terminates on 7. 

Proof. By contradiction, assume that we have two past-pointing lightlike 
geodesics Ai and A2 from p to 7. If they meet 7 at the same point. Proposi- 
tion 3 shows that the past lightlike cut locus of p cannot be empty. So let us 
assume that the two geodesics meet 7 in two different points Ai(si) = q\ ^ 
A2(s 2) = 92, with qi in the past of q2 (say). Then we can join the section of 
7 between q\ and 52 to A2 to get a past-pointing causal curve from p to q\ 
that is not a lightlike geodesic. By a well-known theorem (see Hawking and 
Ellis [29], Proposition 4.5.10) this implies that q\ is in I~{p). Together with 
the past-distinguishing assumption this makes sure that the past cut point 
of p along Ai must exist which gives the desired contradiction. □ 

A slightly weaker version of this result, assuming the strong causality condi- 
tion rather than the past-distinguishing condition, was given in Perlick [58]. 

Next we give a sufficient criterion for multiple imaging. The examples 
studied at the end of the preceding subsection suggest that a past light cone 
forms several sheets after past-pointing light rays have passed through cut 
points or conjugate points, and that this gives rise to multiple imaging situ- 
ations. The following proposition puts this general idea into precise form. 

Proposition 7. Fix, in an arbitrary spacetime (At, 5, T”'*’), a point p and a 
past-pointing lightlike geodesic A : / — > M with A(so) = p. Assume that, for 
some parameter si > So in I, A(si) is a conjugate point or the past cut point 
{or both) of p along A. Then, for every parameter value s G I with s > Si, 
there is a timelike curve 7 through A(s) that can be reached from p along at 
least two past-pointing lightlike geodesics. 

Proof. We first show that, for s > si, the point A(s) can be reached from p 
along a past-pointing timelike curve. If A(si) is the past cut point of p along 
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A, this follows directly from the definition of cut points. If it is a conjugate 
point, it follows from Proposition 1. Now we take such a timelike curve and 
perturb it slightly near p. In this way we get a timelike curve that intersects 
the past light cone of p in A(s) and in another point close to p, so it can be 
reached from p along two different past-pointing lightlike geodesics. □ 

Together with Proposition 5 this result implies that multiple imaging takes 
place whenever a past light cone fails to be an embedded submanifold. This 
is true, in particular, whenever a past light cone forms a caustic. 

Proposition 7 can be used to prove that multiple imaging occurs in large 
classes of spacetimes. E. g., it is well known that, under conditions which are 
to be considered as fairly general from a physical point of view, a lightlike 
geodesic must be either incomplete or contain a pair of conjugate points. 
Those “fairly general conditions” are, e. g., the weak energy condition and the 
socalled generic condition. We do not want to go into this matter here. We just 
mention that results of this kind have played a crucial part in the development 
of the Penrose-Hawking singularity theorems and we refer to the detailed 
discussion in Hawking and Ellis [29], in particular to Proposition 4.4.5. We 
also mention that the weak energy condition need not hold pointwise but that 
some integrated version of the weak energy condition would do, see Tipler 
[77], Horde [ 6 ], Roman [67] and Kannar [34]. In view of these results it seems 
justified to say that the occurence of conjugate points along lightlike geodesics 
is the rule rather than the exception. But then Proposition 7 implies that the 
occurence of multiple imaging is the rule rather than the exception. 

One has to keep in mind that the worldline 7 in Proposition 7 must be 
constructed in a particular way. There is no guarantee that the real universe 
for which {A4,g) is a mathematical model contains a real light source (i. e., 
a galaxy or a quasar) that travels on this worldline 7 . Therefore it would 
be nice to have an analogous proposition in which both the point p and the 
worldline 7 are to be prescribed. Such a proposition holds in globally hyper- 
bolic spacetimes and will be proven in Sect. 4, see Proposition 17 below. 

Proposition 7, which is a fairly simple corollary of standard theorems, was 
given in Perlick [58]. Already earlier, Padmanabhan and Subramanian [51] 
had shown that the existence of conjugate points along a lightlike geodesic 
is sufficient for multiple imaging. However, their proof is completely different 
from ours and it uses a lot of additional assumptions on the topological and 
causal structure of spacetime most of which slip in surreptitiously. On the 
basis of these additional assumptions, Padmanabhan and Subramanian [51] 
were also able to show that the existence of conjugate points along lightlike 
geodesics is necessary for multiple imaging. We have already emphasized that 
this is not true in arbitrary spacetimes, please recall Example 1 at the end 
of the preceding subsection. We are now going to investigate topological and 
causal conditions on the spacetime that allow to prove such a result. 

Example 1 might suggest that in simply connected spacetimes multiple 
imaging situations without conjugate points cannot occur. A more careful 
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analysis shows that it is not the topology of (4-dimensional) spacetime but 
rather the topology of (3-dimensional) space that matters. To make this no- 
tion precise we have to consider a timelike C°° vector field V {observer field) 
on A4. The existence of such an observer field is guaranteed on all of Ai 
owing to the time-orientability assumption (c) of Definition 1, cf., e. g., Wald 
[80], Lemma 8.1.1. With such a V chosen, we may call any two points of 
M equivalent if they lie on a common integral curve of V. The correspond- 
ing quotient space, equipped with the quotient topology, will be denoted by 
Sv and can be interpreted as the space with respect to the observer field 
V. Please note that Sv need not satisfy the Hausdorff axiom; as a counter- 
example one may consider any timelike vector field on Minkowski space with 
one point removed. Also, there is no guarantee that admits a smooth 
manifold structure such that the natural projection Try : M — > Sv becomes 
a submersion; as a counter-example one may consider a timelike vector field 
V with an integral curve that is almost periodic. For later purpose we state 
the following result. 

Proposition 8. Let (Ai, g, be a spacetime that does not contain a closed 
timelike curve and let V be a timelike C°° vector field on Ai. If the quotient 
space Sv satisfies the Hausdorff axiom, Sv admits a C°° structure such that 
the natural projection Try : Ai — > Sv makes Ai into a fiber bundle over Sv 
with typical fiber diffeomorphic to R. 

For a proof we refer to Harris [27], Theorem 2. Note that Harris’ assumption 
of V being complete is unnecessary since every nowhere vanishing vector 
field on Ai can be made into a complete vector field by multiplication with an 
appropriate positive function. To prove this, one puts a complete Riemannian 
metric h on Ai. This is possible since, by a famous theorem of Whitney [86] 
(see also, e. g., Hirsch [31], p. 55) every n-dimensional paracompact manifold 
can be smoothly embedded as a closed submanifold into pulling back 

the Euclidean metric gives the desired complete Riemannian metric. It is 
then easy to check that the vector field h{V, V is complete, cf., e. g., 

Abraham and Marsden [1], Proposition 2.1.21. 

For the following consideration we only need the topological structure on 
Sv- We define for any point p € Ai the set Sy, called the space visible to p 
with respect to V, in the following way. We say that a point in Sv is in Sy if 
and only if the integral curve of V which is represented by that point either 
passes through p or can be reached from p along a past-pointing lightlike 
geodesic in Ai. We are now ready to formulate the desired proposition. 

Proposition 9. Choose a timelike C°° vector field V on an arbitrary space- 
time {Ai, g,T^). Fix a point p € Ai and assume thatSy, the space visible to 
p with respect to V , is simply connected. If the past lightlike conjugate locus 
of p is empty, any integral curve of V can be reached from p along at most 
one past-pointing lightlike geodesic. 
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Proof. As the past lightlike conjugate locus of p is empty, the map e~ of (10) is 
an immersion. Then its image, the past light cone of p, is an immersed lightlike 
submanifold of A4. Since a timelike vector cannot be tangent to a lightlike 
submanifold, each integral curve of V intersects the image of e~ transversely. 
Hence, the combination of e~ with the projection Try : A4 — > Sy gives 
a homeomorphism locally around each point, i. e., it gives a covering map 
from C~ onto Sy. As a covering map onto a simply connected space must 
be a (global) homeomorphism, no integral curve of V can intersect the past 
light cone of p more than once and the past light cone of p cannot have self- 
intersections or almost self-intersections. The latter implies that e~ is even 
an embedding, i. e., it is impossible that a point can be reached from p along 
two different past-pointing lightlike geodesics. □ 

The condition of Sy being simply connected prohibits, in particular, situ- 
ations such as in Example 1 where a non-transparent deflector is modeled 
by a hole. However, it may also be violated in situations with transparent 
deflectors, viz., if the visible universe has a non-trivial spatial topology. 

For some situations of interest, at least. Proposition 9 says that multi- 
ple imaging requires the occurence of conjugate points. This is a valuable 
result since the existence of conjugate points along a lightlike geodesic allows 
to estimate the Ricci tensor along that geodesic. If we take Einstein’s held 
equation into account, this estimate of the Riccci tensor can be rewritten 
as an estimate on the energy density, see Padmanabhan and Subramanian 
[51]. It is this observation that makes it physically interesting to investigate 
whether in a multiple imaging situation conjugate points must occur. 

We summarize the results found sofar in the following way. The occurence 
of conjugate points or of cut points along a past-pointing lightlike geodesic is 
always sufficient for multiple imaging. If the past-distinguishing condition is 
satisfied at the observer’s position, the occurence of cut points is necessary as 
well. If the space visible to p (with respect to an observer held) has a simply 
connected topology, then the occurence of conjugate points is also necessary. 

We end this subsection with a group of propositions characterizing the 
special situation that a worldline 7 meets the caustic of the past light cone 
of p. We first show that this is an exceptional situation. 

Proposition 10. Let 7 : / — > M. he a timelike C°° eurve in an arbitrary 
spaeetime {M, g,T^). Then the set of all points p € M such that 'f does not 
meet the caustic of the past light cone of p is dense in M.. 

Proof. For each p in some open subset U C M, we consider the map e~ of 
(10) and identify its domain C~ with R^\{0}. This can be done with the help 
of local coordinates in the tangent bundle. Then the assignment p 1 — > e~ 
gives a continuous embedding from U into the space C^(R^ \ {0}, Al) of 
maps from \ {0} into A4, equipped with the weak (or compact-open) 
topology. For the definition of this topology we refer to Hirsch [32], p. 34. By 
the transversality theorem (see, e. g., Hirsch [32], Theorem 2.1), the maps 
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which are transverse to 7 form a dense subset of \ {0}, Al). Thus, the 

points p for which e~ is transverse to 7 are dense in U. Please recall that, 
by definition, e~ is transverse to 7 at a point q = e~(X) if either g ^ 7 or 
the image of Txe~ and the tangent space of 7 span all of TqM. Clearly, if 7 
meets the caustic of the past light cone of p at e~{X), transversality cannot 
be satisfied since the image of Txe~ is at most two-dimensional. □ 

In other words, by a “small perturbation” of the point p we can always achieve 
that a given worldline 7 stays away from the caustic. But then multiple 
imaging situations are restricted by the following result. 

Proposition 11. If, in an arbitrary spacetime {M, a timelike C°° 

curve 7 : I — > A 4 does not meet the eaustic of the past light eone of a point 
p, then there are at most denumerably many past-pointing lightlike geodesies 
that start at p and terminate on 7. 

Proof. Consider the pre-image Ap^-f = (e“)“^(7) of 7 under the map (10). 
If 7 does not meet the caustic, e~ is an immersion at each point X £ Ap^~^, 
i. e., it maps a neighborhood of X in C~ onto a 3-dimensional submanifold 
which is lightlike and, thus, transverse to 7. This proves that the points in 
Ap^-./ are isolated, i. e., that there are only finitely many in each compact 
subset of C~ . Since C~ ~ \ {0} can be covered with denumerably many 

compact sets this completes the proof. □ 

These two propositions justify our earlier claim that for multiple imaging 
situations Case C is generic and Case D is exceptional. However, again we 
emphasize that these results crucially depend on our idealization of assuming 
a pointlike source. They are, of course, no longer true if the worldline 7 is 
replaced with a worldsheet or a worldtube. 

Proposition 11 implies that a Case D situation is possible only if 7 meets 
the caustic of the past light cone of p. In other words, if this light cone does 
not develop a caustic, then it is impossible for the observer at p to see an 
extended image such as an arc or a ring. We end this section by proving 
our earlier claim that, in a Case D situation, all parts of an extended (and 
connected) image show the light source at the same age. 

Proposition 12. Let p be a point and 7 : I — > A 4 a timelike C°° curve in 
a spacetime {A 4 , g,T~^). Let Ap^-^ denote the pre-image of j under the map 
e~ which was introduced in (10). Then each connected component of Ap^-y is 
mapped by e~ onto a single point. 

Proof. Assume the image is not a single point. Then, by continuity of the 
exponential map, it is a one-dimensional timelike submanifold, viz., a portion 
of 7. This contradicts the fact that the image of Txe~ is always lightlike, so 
the light cone cannot contain a timelike curve. □ 
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3.4 Fermat’s Principle 

For many applications it is useful to characterize the lightlike geodesics be- 
tween a point and a timelike curve in a spacetime as the solutions of a varia- 
tional problem. There are several versions of such a variational problem which 
may be viewed as general-relativistic generalizations of the traditional Fermat 
principle. The oldest versions, which hold on static or stationary spacetimes 
only, date back to Weyl [84] and Levi-Civita [41]. They are also discussed 
in several modern text-books and review articles, see, e. g., Frankel [17] or 
Straumann [76] for the static case and Landau and Lifschitz [40] or Brill [ 8 ] 
for the stationary case. For a discussion from a mathematical point of view 
we refer to Masiello [44]. Here we want to present a more general version 
of Fermat’s principle which holds on arbitrary spacetimes. Its formulation is 
due to Kovner [36] and the proof that the solution curves of this variational 
problem are, indeed, the lightlike geodesics was given by Perlick [55]. The 
same version of Fermat’s principle is also discussed in Schneider, Ehlers and 
Falco [70]. As an aside, we mention that this version of Fermat’s principle 
may be generalized to the case of light rays in media, see Perlick [59] for a de- 
tailed exposition, and to the case of extended (i. e., non-pointlike) observers 
and light sources, see Perlick and Piccione [60]. According to the framework 
of this article we shall not discuss these generalizations here. 

It is our goal to characterize, in an arbitrary spacetime (A4, the 

lightlike geodesics from a point p to a timelike worldline 7 by a variational 
principle. To that end we have to specify (i) the set of trial curves, i. e., the 
set of curves among which the solutions to the variational problem are to 
be sought, and (ii) the functional that is to be extremized. The set of trial 
curves, denoted by henceforth, is defined in the following way. 

Definition 4. Fix, in an arbitrary spacetime (At, g, T^), a point p € A4 and 
an embedded timelike C°° curve 7 : / — ^ A4. Then is, by definition, 
the set of all C°° immersions A : [0, 1] — > Ai with the following properties. 

(a) A is lightlike, i. e., p(A'(s), A'(s)) = 0 for all s G [0, 1]. 

(b) A starts at p and terminates on 7 , i. e., A(0) = p and there is a r(A) € I 
such that A(l) = 7 (t(A)) . Since 7 is assumed to be an embedding, this defines 
a unique assignment A 1 — > t(A). 

(c) p(A'(l), 7 '(r(A))) < 0, where t(A) G / is defined through (b). 

Roughly speaking, the space of trial curves can be characterized as the set of 
all ways to go from p to 7 at the speed of light. Our decision to define all trial 
curves on the interval [0, 1] is a matter of convenience only. Condition (c) of 
Definition 4 restricts to future-oriented or past-oriented curves, depending on 
whether 7 is future-oriented or past-oriented. For applications to gravitational 
lensing we are interested in the case that 7 is a past-pointing parametrization 
of the worldline of a light source, see Fig. 1. 

Condition (b) of Definition 4 defines a map r : — >■ M. We refer to 

T as to the arrival time functional henceforth. This will be the functional to 
be extremized. - Finally, we need the following definition. 
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Definition 5. For a curve A G as defined in Definition 4, a C°° varia- 
tion of A in is a C°° map 77 : ] — £To, £o[ x [0, 1] — M, for some £0 > 0, 
such that 77(0, • ) = -^ 77(e, • ) G for all e G ] — £o,£o[ ■ The vector 

field X : [0,1] — > TXi , s 1 — > X(s) = (77 ( • ,s))^(0) is called the variational 
vector field of rj . X is called non-trivial if X(s) and A'(s) are non-collinear 
for some s G [0, 1] . 

We want to prove that among all trial curves the lightlike geodesics are the 
stationary points of the arrival time. To that end we shall need the following 
characterization of variational vector fields. 

Lemma 1. For a C°° vector field X : [0, 1] — > TXi along A G Cffiy, the 
following two properties are equivalent. 

(a) X is the variational vector field of a C°° variation r] of X in 

(b) 3 (VaW,A') = 0, X(0) = 0, a77dX(l) II y(r (A)). 

The implication “(a)=i>(b)” is obvious since the desired properties of X follow 
just by differentiating the defining properties of trial curves. For a proof of 
the converse implication, which is more cumbersome, the reader is refered to 
Perlick [55]. 

We are now ready to formulate and prove the general-relativistic Fermat 
principle. 

Theorem 1. (Fermat’s principle) Let g,'T~^) be an arbitrary spacetime, 
fix a point p G M and an embedded timelike C°° curve 7 : / — > Xi . Then 
for a trial curve A G the following two properties are equivalent. 

(a) X is a geodesic or a reparametrization thereof. 

(b) For all C°° variations p of X in the equation ^T(r]{£, = 0 

holds true. 



Proof. We first observe that the definition of the arrival time r implies the 
equation p{e, 1 ) = 7(r(77(e, • ))) for each variation 77 of A in Differ- 
entiating with respect to e and setting £ = 0 yields 



X{1) = 7 (r(A)) -^t{p{£, •)) 



( 12 ) 



We now prove the implication “(a)=^(b)”. By assumption, there is a C°° 
function w : [0,1] — >■ K such that X/yX' = w X' . Now let X be the variational 
vector field of a variation p of X in Then we find wg{X,X') = 

g{X, Va' A') = g{X, A')' — g{Xx'X, A') . The last term vanishes by Lemma 1. 
Upon integration, we find 



g(X(l),A'(l)) 



5(X(0), A'(0)) exp 




(13) 



Since X(0) = 0, this implies g(X(l), A'(l)) = 0. But then the desired re- 
sult can be read from (12) because the timelike tangent vector to 7 can- 
not be orthogonal to the lightlike tangent vector to A. - To prove the con- 
verse implication “(b)=>(a)” we define a vector field U\ : [0,1] — >■ TXi 
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by parallel transporting the vector 7 '(t(A)) along A, i. e., V\'U\ = 0 and 
t/A(l) = 7 '(t(A)). Clearly, U\ is everywhere timelike, so g{X',U\) has no ze- 
ros. Now let Z : [0, 1] — > TM. be any C°° vector field along A with Z{0) = 0 
and Z(l) = 0. We use this Z to define a new vector field X along A by 



X(s) = Z(s) 



ff(VyZ,A') 

g{Ux,\') 




(14) 



With the help of Lemma 1 it is easy to verify that X is the variational vector 
field of a variation ry of A in Hence, we can read from (12) that 
our hypothesis implies ^(1) = 0. As Z(l) = 0 and (/(?7a(1), A'(l)) ^ 0, the 
integral in (14) must vanish for s = 1. Upon integration by parts, this results 
in 




Z,Vx' 



y 

g{Ux,y) 



) 



ds = 0 . 



(15) 



Since Z was an arbitrary C°° vector field along A with Z{0) = 0 and Z{1) = 0, 
the fundamental lemma of variational calculus implies that 



A' 



g{Ux,y) 



= 0 , 



(16) 



i. e., that Va'A' is a multiple of A'. 



□ 



Theorem 1 can be phrased as saying that among all ways to go from p to 7 at 
the speed of light, the light rays are characterized as the stationary points of 
the arrival time t. The analogy to the traditional Fermat principle is obvious. 
For some applications it might be convenient to choose 7 as parametrized 
by proper time, g{j ' = —1. However, Theorem 1 is true for any other 
smooth parametrization as well. The arrival time functional r : — > R 

changes, of course, if the parametrization is changed, but the new arrival time 
functional has the same stationary points as the old one. 

Please note that, by Theorem 1, a light ray may be a local minimum, a 
local maximum or a saddle of r. We shall see in the next subsection that, 
actually, local maxima do not occur. 

From Theorem 1 we can easily rederive the more special versions of Fer- 
mat’s principle which are given in many text-books on general relativity. 
To illustrate this claim we consider the special case of a conformally static 
spacetime. More precisely, we need the following assumptions. 

(a) Xi is diffeomorphic to 5 x R, with some 3-dimensional manifold S. 

(b) The metric takes the form 

g = dx^ ® dx" — dt®dt)j (17) 

where t denotes the projection from A4 ~ 5 x M onto the second factor, 
X = (a;^, x^ ,x^) are coordinates on S and the Einstein summation convention 
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is used for greek indices running from 1 to 3 . (Coordinates on S are used for 
notational convenience only. It will not be necessary to assume that S can 
be covered by a single coordinate system.) 

(c) toy is constant, i. e., 7 is vertical with respect to the product structure 
of ~ 5 X R. 

In this situation the spacetime geometry is time independent up to an over- 
all factor and the worldline 7 is at rest in the “space” S. Fermat’s 

principle takes its simplest form if we use for 7 a parametrization adapted to 
t, i. e., dt{'y') = 1. Since all trial curves are lightlike, we can then read from 
(17) that the arrival time functional is given by 

rW = /' ^ ( 18 ) 

Jo 

Here s 1 — >■ a;(s) denotes the projection onto the first factor of s 1 — >■ A(s) € 
M ~ 5 X R. By (18), r(A) is exactly the length of the projected curve 
s I — > x{s), measured with the spatial metric h = dx^ 0 dx'^ . Hence, 

in this special case Theorem 1 says that a trial curve is a light ray if and 
only if its projection to S traces out an h-geodesic. This result is due to Weyl 
[84], apart from the fact that Weyl restricted to the static case, i. e., he did 
not allow the function / in (17) to depend on t. There is a straightforward 
generalization from the (conformally) static to the (conformally) stationary 
case which is essentially due to Levi-Civita [41]. The solution curves are then 
no longer h-geodesics but modified by a kind of Coriolis force. For a detailed 
discussion, including several examples, the reader is refered to Perlick [56]. 

3.5 Morse Index Theory for Fermat’s Principle 

Fermat’s principle admits several interesting applications to gravitational 
lensing. E. g., Schneider [69] has shown that Fermat’s principle can be used 
in the derivation of the so-called lens equation of the quasi-Newtonian ap- 
proximation formalism, see also Schneider, Ehlers and Falco [70]. Again in 
the quasi-Newtonian approximation, Blandford and Narayan [5] have used 
Fermat’s principle to give a topological classification of images. Owing to 
the approximation assumptions, in these situations it suffices to consider 
Fermat’s principle on conformally static spacetimes. An application to grav- 
itational lensing of Theorem 1 where the conformally static or conformally 
stationary version would not do was worked out by Kovner [36]. He consid- 
ered a gravitational wave sweeping over a gravitational lensing situation and 
calculated, to within certain approximations, the influence of the wave on 
the arrival times and on the positions of the images at the observer’s sky. 
This line of thought was further developed by Faraoni [14] who assumed the 
spacetime to be a first order (but non-stationary) perturbation of Minkowski 
space and used a coordinate version of Fermat’s principle, given in Perlick 
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[55] , to calculate integral formulae for the arrival time and for the deflection 
angle. 

Here we want to discuss a different application of Fermat’s principle to 
gravitational lensing. The basic idea is to formulate a Morse theory for Fer- 
mat’s principle, in analogy to the classical Morse theory of Riemannian ge- 
ometry, and to investigate the significance of the Morse relations in view of 
gravitational lensing. As a first step towards this goal, we establish a Morse 
index theorem for Fermat’s principle, thereby investigating whether a solu- 
tion curve yields a local minimum, a local maximum or a saddle of the arrival 
time functional. A full Morse theory has to presuppose a globally hyperbolic 
spacetime and will be the subject of Sect. 4.2 below. 

As a preparation, it is certainly useful to recall the classical Morse index 
theory of Riemannian geometry which was developed by Morse [48] in the 
1930s. Let p and q be two points in a Riemannian manifold (Af, h), i. e., 
in a manifold with a positive definite metric. Then, among all sufficiently 
regular curves a : [0, 1] — Af with a(0) = p and a(l) = q, the geodesics 
are characterized as the stationary points of the energy functional E(a) = 
/p h(^a' (s) , a' (s)) ds . To find out whether a geodesic a is a local minimum, a 
local maximum or a saddle of E one has to calculate the Hessian HesSa(if) of 
E at the point a (i. e., the “second variation”) and to determine the index and 
the extended index of HesSo,(£'). Please recall that the index (or the extended 
index, respectively) of a bilinear form is the maximal dimension of a subspace 
on which this bilinear form is negative definite (or negative semi-definite, 
respectively). The classical Morse index theorem says that HesSo,(if) is non- 
degenerate if and only if the endpoint a(l) is not conjugate to the initial point 
a(0) along the geodesic a and that the extended index of HesSo,(i5) is equal 
to the number of points o;(s) , s G ]0, 1] , that are conjugate to a(0) along a. 
Here each conjugate point is to be counted with its multiplicity. Based on the 
Morse index theorem, Morse was able to establish a number of theorems, now 
summarized under the name of Morse theory, to the effect that the number 
of geodesics joining two points p and q in a complete Riemannian manifold is 
related to the topology of the space of sufficiently regular curves joining these 
two points. Morse proved these results by considering the energy functional on 
the finite-dimensional space of broken geodesics with N breakpoints between 
Pi and P2, and then letting N ^ oo. For a detailed review of Morse’s work we 
refer to Milnor [46] . Later Palais and Smale [52] [53] brought forward a fresh 
approach to Morse theory by considering functionals on infinite-dimensional 
Hilbert manifolds. It was then no longer necessary to approximate the space 
of trial curves by A^-dimensional spaces and to consider the limit N ^ oo 
afterwards. It is this Palais-Smale version of Morse theory we want to apply 
to Fermat’s principle. 

It should be mentioned that a Morse index theory for the geodesic varia- 
tional problem (i. e., extremizing the energy functional between two points) 
exists not only for geodesics in Riemannian manifolds but also for timelike 
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and lightlike geodesics in Lorentzian manifolds, see Beem, Ehrlich and Easley 
[4] for a detailed exposition. However, this is not what we are interested in. 
We want to characterize lightlike geodesics between a point and a timelike 
curve (not between two points), and the functional we are going to extremize 
is the arrival time (not the Lorentzian analogue of the energy functional). 

The following exposition closely follows Perlick [57]. It is our goal to estab- 
lish a Morse index theorem for Fermat’s principle in an infinite-dimensional 
Hilbert manifold setting a la Palais-Smale. To that end we have to modify 
Fermat’s principle, as it was given in Theorem 1, a little bit. First, we observe 
that, according to Definition 4, all trial curves A G are of class C°°. It 
is well known that C°° maps from one manifold into another do not form 
a Hilbert manifold but, at the very best, a Frechet manifold. Since this is 
too weak for applying Morse theory, we shall replace the C°° condition on 
the trial curves by a Sobolev H'" condition in order to get a Hilbert mani- 
fold. Second, we observe that the arrival time functional r is invariant under 
reparametrization. As a consequence, its Hessian at a solution curve A is al- 
ways degenerate because it vanishes on the infinite dimensional vector space 
of trivial variational vector fields (please recall Definition 5). We shall solve 
this problem by imposing a parametrization fixing condition upon the trial 
curves. 

To work this out we have to introduce the Hilbert manifold of Sobolev 

curves. For background material on this subject the reader is refered to 
Schwartz [71]. The same book also contains a review of the Palais-Smale 
version of Morse theory. For /i, /2 G C"’” ([0, 1], K") , we define 



< /i I /2 >r — 



fi\s)-f2\s)ds 

i=odo 



(19) 



where denotes the i-th derivative of /i and the dot denotes the standard 
scalar product in M”. It is easy to check that this scalar product makes 

([0, 1], K”) into a real pre-Hilbert space. The completion of this pre- 
Hilbert space is, by definition, the Sobolev space iJ’'([0, 1],M"). For r = 0 this 
gives the real Lebesgue space L^([0, 1], K.") whose complex version is known 
to every physicist from quantum mechanics. For integers r > 1, iJ’’([0, 1], R”) 
can be (and will be henceforth) identified with the space of all maps 

from [0, 1] into R" whose r-th derivatives exist almost everywhere and are 
locally square integrable. 

Now we introduce the notion of curves in a manifold. Let A4 he a 
real, finite-dimensional C°° manifold whose topology satisfies the Hausdorff 
axiom and the second countability axiom. Then we define 



H'-([0,1],M) = {a: [0,1] 



joAg7J’'([0,1],R")} 



( 20 ) 



where j : A4 — >■ R" is a C°° embedding. A well-known theorem of Whitney 
[86] (see also, e. g., Hirsch [31], p. 55) guarantees the existence of such an em- 
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bedding for n > 2dim(Al) + l . It is easy to show that the set l],M) is 

independent of which j has been chosen. Moreover, it is a well-known result of 
Palais and Smale [53] that the inclusion map i?’'([0, l],Al) — iJ’’([0, 1], K") 
induced by j makes ^^([0, 1], Al) into a C°° submanifold of the Hilbert 
space i?’’([0, and that the manifold structure thereby established on 
1], Al) is, again, independent of j. Thus, we may view H’’([0, 1], Al) 
as an infinite dimensional real C°° Hilbert manifold in its own right. 

To define the modified space of trial curves we restrict the original space 
of Definition 4 by a parametrization fixing condition. For A € 
we define a vector field U\ : [0, 1] — TAi by parallel transporting the 
vector 7 ^('t(A)) along A, as in the proof of Theorem 1. Then the condition 
g{U\, A') = const, singles out exactly one parametrization for each trial curve. 
Please note that this condition singles out an affine parametrization along 
each geodesic. Now we define the modified space of trial curves in the following 
way. 

Definition 6 . Fix a point p and a timelike embedded C°° curve 7 : I — Ai. 
Then the space is, by definition, the set of all A G iJ^([0, 1], Ad) with 
the following properties. 

(a) 5(A',A')=0. 

(b) A(0) = p and there is a r(A) G / such that A(l) = 7 (t(A)) . 

(c) g{U\, A') = const. < 0 . 

It can be shown that Hp-^ is, indeed, an infinite dimensional Hilbert sub- 
manifold of H^([0, 1], At) and that the arrival time functional r : — >■ R 

defined by (b) is a C°° map. For a proof of these facts we refer to Perlick [57]. 
This result remains true if the condition in Definition 6 is replaced with 
an condition for r > 2 ; it is not true, however, for r = 1. Now Fermat’s 
principle, i. e.. Theorem 1, can be reformulated in the following way. 

Theorem 2. A curve A G is a geodesic if and only if the differential of 
the arrival time functional t : — > R has a zero at A. 

The proof, which is worked out in Perlick [57], is a straightforward translation 
of the proof of Theorem 1 into an setting. 

For the Morse index theorem we have to calculate the Hessian HessA(r) 
of T at a geodesic A. We find the following result. 

Theorem 3. (Morse index theorem) Let A G ^ be a geodesic. The index 
o/HessA(T) is equal to the number of points A(s) , s G ]0,1[, that are con- 
jugate to A(0) along A. The extended index o/HessA(T) is equal to the number 
of points A(s) , s G ]0,1] , that are conjugate to A(0) along A. In both cases 
conjugate points are to be counted with their multiplicities. 

The proof of this theorem is given in Perlick [57]. The strategy of this proof 
is to relate the second variational formula for Fermat’s principle to the sec- 
ond variational formula for the geodesic variational problem. For lightlike 
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geodesics, the latter is worked out in Beem, Ehrlich and Easley [4], Chap. 10. 
We have already emphasized the differences between these two variational 
problems. Nonetheless, their second variational formulae turn out to be es- 
sentially the same. Thereby, the proof of Theorem 3 comes as a corollary of 
the Morse index theorem proven in Beem, Ehrlich and Easley. 

Theorem 3 has the following immediate consequences. 

(a) HessA(T) is non-degenerate if and only if A(l) is not conjugate to A(0) 
along A. 

(b) The index and the extended index of Hess a (r) are finite for all geo- 
desics A G . Hence, A cannot be a local maximum of r. 

(c) A geodesic A G is a strict local minimum of t if and only if A 
does not contain a point conjugate to A(0) 

(d) A geodesic A G is a saddle of r if it contains a point A(s) which 
is conjugate to A(0) for some s G ]0, 1[ . 

In view of gravitational lensing, the index has the following interpretation. 
At each conjugate point, infinitesimally neighboring light rays “cross over” 
from one side of A to the other. This is associated with a side-reversion of 
the image. Thus, light rays with an even index yield a mirror image of those 
with an odd image. This is observable if our pointlike light source (e. g., the 
core of a galaxy) is surrounded by some non-symmetrical structure (e. g., 
irregular lobes or jets). 

In Sect. 4.2 below we use the Morse index theorem to develop a full 
Morse theory for light rays joining a point and a timelike curve in a globally 
hyperbolic spacetime and we discuss applications to gravitational lensing. 

4 Gravitational Lensing 

in Globally Hyperbolic Spacetimes 

We have seen in the preceding section that the geometry of multiple imaging 
situations is strongly influenced by the topological and causal structure of 
spacetime. Such global effects are usually ignored in the astronomical litera- 
ture on gravitational lensing where, typically more implicitly than explicitly, 
the deflector is assumed to be embedded in a universe without topological or 
causal pathologies. 

In this section we get somewhat closer to the standard astronomer’s point 
of view by restricting to spacetimes without causal pathologies. More pre- 
cisely, we are going to consider spacetimes that are globally hyperbolic ac- 
cording to the following definition. 

Definition 7. For a spacetime (AI,g,T''"), a subset 5 of A4 is called a 
Cauchy surface if each inextendible causal curve in M intersects S in exactly 
one point. A spacetime is called globally hyperbolic if it admits a Cauchy 
surface. 
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The name “globally hyperbolic” was introduced by Leray [39] in 1952. It 
refers to the fact that a global existence and uniqueness theorem for hy- 
perbolic partial differential equations can be established only on spacetimes 
with this property. Actually, our Definition 7 of global hyperbolicity does not 
coincide with Leray’s original definition, but it is well known that the two 
definitions are equivalent, see, e. g., Wald [80], p. 209. Basic properties of 
globally hyperbolic spacetimes are also reviewed in Hawking and Ellis [29], 
in O’Neill [50] and in Beem, Ehrlich and Easley [4]. 

One can show that a Cauchy surface 5 is a 3-dimensional topological 
submanifold of Ai, see, e. g.. Theorem 8.1.3 and Theorem 8.3.1 in Wald [80]. 
Mimicking the proof of Proposition 6.3.1 in Hawking and Ellis [29], one may 
even show that 5 is a (i. e., Lipschitz) submanifold of A4. In general, a 
Cauchy surface will not be a submanifold of Ai. 

In Sect. 3.3 we have introduced, for each timelike C°° vector field V on 
Ai, the quotient space Sy = Aij,^, where two points in Ai are considered 
equivalent if they can be connected by an integral curve of V . We shall now 
use Proposition 8 to show that in a globally hyperbolic spacetime Sy comes 
not only with a topological but even with a differentiable structure. To that 
end, we first observe that the existence of a Cauchy surface makes sure that 
there are no closed timelike curves in Ai. Since every integral curve of V 
intersects a Cauchy surface S exactly once, the restriction of the natural 
projection 



-Ky-.Ai — ^ (21) 

to S gives a homeomorphism from S onto Sy. Since S, being a topologi- 
cal submanifold of a Hausdorff space, must satisfy the Hausdorff axiom, this 
proves that Sy satisfies the Hausdorff axiom. By Proposition 8, there is a 
structure on Sy such that TVy : Ai — Sy makes Ai into a fiber bundle 
over Sy with fiber diffeomorphic to M. This argument implies that any two 
Cauchy surfaces in M. must be homeomorphic and that, for any two time- 
like vector fields V and V on M. the quotient manifolds Sy and Syi 
must be homeomorphic. According to a famous theorem of Moise [47] any 3- 
dimensional topological manifold admits exactly one differentiable structure. 
Hence, iSy and Syi must even be diffeomorphic. 

Geroch [20] has established the important fact that every globally hyper- 
bolic spacetime admits a continuous function t : M. — > M such that the set 
is a Cauchy surface for each to € M. It is widely believed that such 
a Cauchy time function t can be chosen differentiable, employing a smooth- 
ing argument of Seifert [72]. However, the details of the proof have never 
been worked out completely, and several dedicated experts who tried to do 
so failed. In any case, every globally hyperbolic spacetime admits a continu- 
ous Cauchy time function t which can be combined with the C°° projection 
TTy determined by a timelike C°° vector field V to give a homeomorphism 
(tt, t) : Ai — >■ Sy X R. Hence, the topology of a globally hyperbolic spacetime 
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is determined by the topology of any of its Cauchy surfaces. Note, however, 
that 5i X R may be homeomorphic to 52 x R without 5i being homeomorphic 
to 52- Therefore, the topology of a globally hyperbolic spacetime does not 
determine the topology of its Cauchy surfaces. E. g., Newman and Clarke 
[49] contrived a spacetime with topology that admits a Cauchy surface 
which is not homeomorphic to R^. 

It is a matter of debate whether the assumption of global hyperbolicity 
is to be considered as “reasonable” from a physical point of view. It is cer- 
tainly true that most physicists consider the validity of a global existence and 
uniqueness theorem for wave equations as a requirement any “reasonable” 
spacetime should satisfy. Moreover, only globally hyperbolic spacetimes arise 
from globally solving the initial value problem of Einstein’s (vacuum) field 
equation. From the viewpoint of global Lorentzian geometry, however, global 
hyperbolicity is a very strong assumption. In particular, any globally hyper- 
bolic spacetime has to satisfy, at each point p, the strong causality condition 
and, thus, the distinguishing conditions and the causality condition, recall 
Definition 3. Also, it is easy to verify that removing a point, a worldline or a 
worldtube from any spacetime necessarily results in a spacetime that is not 
globally hyperbolic. This remark is important for gravitational lensing where 
non-transparent deflectors are modeled by excising worldlines or worldtubes 
from spacetime. - We summarize these observations in the following way. 

In view of gravitational lensing situations, restricting to globally hy- 
perbolic spacetimes means restricting to the case of a transparent 
deflector in a spacetime without causality violation whose topology 
is a product of space and time. 

Since a Cauchy surface may have a complicated topology, the restriction to 
globally hyperbolic spacetimes does not exclude universes with “handles” etc. 

4.1 Criteria for Multiple Imaging 

in Globally Hyperbolic Spacetimes 

In Sect. 3.3 we have formulated several criteria for multiple imaging in arbi- 
trary spacetimes. These results can be considerably strengthened if we restrict 
to globally hyperbolic spacetimes. The reason is that light cones in globally 
hyperbolic spacetimes cannot be too pathological. In particular, the following 
technically important proposition holds true. 

Proposition 13. For a point p in a globally hyperbolic spacetime {Ai, g,'T~^) 
the past light cone of p united with {p} gives a closed subset of Ai. 

Proof Let qn be a sequence in the past light cone of p that converges towards 
a point q yf p in A4. We want to show that q is, again, in the past light cone of 
p. To that end we choose a Cauchy surface S through q. This is possible since, 
by the above-mentioned result of Geroch, every globally hyperbolic spacetime 
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can be foliated into (continuously embedded) Cauchy surfaces. Convergence 
of the qn implies that their pre-image under e~ , using the notation of (10), 
is contained in a compact subset of TpM. So, by passing to a subsequence 
we find a sequence Xn in C~ with e~{Xn) = qn that converges towards a 
vector X in the closure of C~ . Since q p, X is different from zero and, 
thus, lightlike. We do not know yet if X is in the domain of the exponential 
map. Let A„ denote the geodesic with A(j(0) = Xn and A the geodesic with 
A'(0) = X. Since 5 is a Cauchy surface, A„ must intersect 5 in a point 
and A must intersect 5 in a point q. Since geodesics depend continuously on 
their initial conditions, the q„ converge towards q. On the other hand, the 
qn converge towards q. As each geodesic A„ intersects S exactly once, this is 
possible only if q = q which implies that q is in the past light cone of p. □ 

In addition, it can be shown that in the globally hyperbolic case the past light- 
like conjugate locus and the past lightlike cut locus of p are closed subsets of 
A4, see Beem, Ehrlich and Easley [4], Propositions 9.27 and 9.29 in combina- 
tion with our Proposition 13. We have already seen in Example 3 at the end 
of Sect. 3.2 that this is not true without the assumption of global hyperbolic- 
ity. - The following proposition says that for globally hyperbolic spacetimes 
the name “cut point” is, indeed, justified because such a point indicates an 
intersection of geodesics. It was already mentioned that the analogous state- 
ment for complete Riemannian manifolds is known as Poincare Theorem and 
dates back to Poincare [66] and Whitehead [85] . 

Proposition 14. (Poincare Theorem for lightlike geodesics) Let p and q be 
two points in a globally hyperbolic spacetime Assume that q is in 

the past lightlike cut locus but not in the past lightlike conjugate locus of p. 
Then there are at least two past-pointing lightlike geodesics from p to q. The 
past light cone of p has a transverse self -inter section at q. 

A proof can be found in Beem, Ehrlich and Easley [4], Theorem 9.15. The 
light cone must have a transverse self-intersection at q since otherwise the 
two lightlike geodesics would arrive with collinear tangent vectors at q. As 
the assumption of global hyperbolicity excludes the possibility of having a 
closed lightlike geodesic through p, this collinearity would imply that the two 
geodesics are the same. 

If q is in the cut locus and in the conjugate locus, then it may be impossible 
to reach it from p along a second geodesic, even in a globally hyperbolic 
spacetime. This is exemplified by the point q in Fig. 5. However, since a 
conjugate point indicates an intersection with an “infinitesimally neighboring 
geodesic” the name “cut point” might be viewed as justified in this case as 
well. 

Proposition 14 has the consequence that in globally hyperbolic spacetimes 
Proposition 5 admits the following converse. 

Proposition 15. Fix a point p in a globally hyperbolic spacetime {Xi, g,'T~^) 
and assume that the map e~ of (10) is a C°° embedding, i. e., that the past 
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light cone of p is an embedded submanifold of M. Then the past lightlike cut 
locus of p is empty. 

Proof. By contradiction, assume that q is the past cut point of p along some 
lightlike geodesic. If q is also conjugate to p we are done since this proves 
that e~ is not even an immersion. If q is not conjugate to p, Proposition 14 
implies that the past light cone of p has a self-intersection at q, so e~ cannot 
be an embedding. □ 

Together with Propositions 5, 6 and 7 this result implies that in a globally 
hyperbolic spacetime there is a multiple imaging situation for an observer at 
p if and only if the past light cone of p fails to be an embedded submanifold 
ofM. 

Propositions 14 and 15 are not true without the assumption of global 
hyperbolicity. To see this consider the light cone of Fig. 4. Divide a spherical 
section of this cone which is close to p into two hemispheres and excise one of 
them, together with its boundary, from spacetime. Then, if the division has 
been chosen appropriately, the light cone becomes an embedded submanifold 
since one half of the light rays is cut off before the light cone forms a self- 
intersection. However, the cut locus remains unchanged since the remaining 
light rays can be reached by the same timelike curves from p as before. 

Another important feature of globally hyperbolic spacetimes is the fol- 
lowing existence result for lightlike geodesics. 

Proposition 16. Let p be a point and 7 an inextendible timelike curve in a 
globally hyperbolic spacetime {A4, g,'T~^) such that 'jr\I~{p) yf 0. Then there 
is a past-pointing lightlike geodesic A from p to -y that is completely contained 
in the boundary of I~ (j>) . This geodesic does not pass through the past cut 
point of p or through a point conjugate to p before it reaches 7 . 

Proof. Global hyperbolicity implies that 7 cannot be completely contained 
in X~ (j>) because it must reach every Cauchy surface in the future of p. 
Therefore our assumptions imply that 7 intersects in some point q. 

It is well known (see, e. g., Wald [80], Theorem 8.1.6) that every point in 
dl~{p) can be reached from p along a past-pointing lightlike geodesic. This 
geodesic cannot pass through the past cut point of p before it reaches q 
since after passing through the cut point a lightlike geodesic stays inside the 
open set I~ (p) . By Proposition 2 this implies that the geodesic cannot pass 
through a point conjugate to p before reaching q. □ 

This proposition gives, in particular, sufficient conditions for the existence of 
a past-pointing lightlike geodesic from p to 7 that does not contain a point 
conjugate to p. A similar result was proven by Uhlenbeck [78], Corollary 4.8, 
with the help of Morse theory. In Sect. 4.2 we shall comment on her work in 
more detail. 

As a corollary of Proposition 16 we immediately get the following suffi- 
ciency criterion for multiple imaging. As an illustration the reader may use 
Fig. 5 with an appropriately placed worldline 7 . 
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Proposition 17. Let p be a point and 7 : I — > M an inextendible timelike 
curve in a globally hyperbolic spacetime {M, Assume that there is 

a past-pointing lightlike geodesic A from p to 'j that passes through a point 
conjugate to p or through the past cut point of p {or both) before it reaches 
7 . Then there are at least two past-pointing lightlike geodesics from p to 'f. 

Proof. The existence of the lightlike geodesic A implies that 7 intersects the 
closure of I~{p). Since 7 is inextendible, this means that it must intersect 
X~ (p) . But then Proposition 16 gives a past-pointing lightlike geodesic from 
p to 7 which is different from A because the latter contains a conjugate point 
or the past cut point of p. □ 

This proposition says that, under certain assumptions, the existence of con- 
jugate points or cut points along a lightlike geodesic is sufficient for multiple 
imaging. Such a sufficiency criterion was already proven in Proposition 7 for 
arbitrary spacetimes. The new feature of Proposition 17 is that in a glob- 
ally hyperbolic spacetime the worldline 7 can be freely prescribed (except 
for the condition of being inextendible, i. e., “sufficiently long”). In view of 
applications to gravitational lensing, this is a great advantage since a real 
light source such as a galaxy or a quasar cannot be expected to travel along 
a worldline that is constructed as in the proof of Proposition 7. 

4.2 Morse Theory in Globally Hyperbolic Spacetimes 

In Sect. 3.5 we have established a Morse index theory for Fermat’s principle. 
Now we want to discuss the possibility of developing a full-fledged Morse the- 
ory, relating the number of solution curves to the topology of the space of trial 
curves. Whereas the Morse index theory works perfectly well on an arbitrary 
spacetime, the full Morse theory requires a globally hyperbolic spacetime. 
This is in analogy to the case of Riemannian geodesics where the Morse 
index theory works on arbitrary Riemannian manifolds but the full Morse 
theory has to presuppose a complete Riemannian manifold. 

For the geodesic problem on complete Riemannian manifolds, Morse the- 
ory exists in two versions. The first version, invented by Morse [48] in the 
1930s and nicely reviewed by Milnor [46], considers for the space of trial curves 
the finite dimensional manifold of broken geodesics between two points with 
N breakpoints. For the final results one has to consider the limit N ^ 00 . 
The second version, brought forward by Palais and Smale [52] [53] in the 
1960s, considers for the space of trial curves the infinite dimensional Hilbert 
manifold of curves between two points. (For the notion of curves please 
recall Sect. 3.5.) Both versions have been carried over to general relativity. 
For Lorentzian manifolds, the most natural analogue of the geodesic problem 
in complete Riemannian manifolds is the timelike geodesic problem in glob- 
ally hyperbolic spacetimes. A Morse theory for this situation was developed 
independently by Uhlenbeck [78] and Woodhouse [87] who both used finite 
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dimensional approximation techniques in the spirit of Morse and Milnor. An 
attempt to find an infinite dimensional Hilbert manifold version, in the spirit 
of Palais and Smale, was brought forward by Everson and Talbot [13] but, 
unfortunately, turned out to be fatally flawed, see Erratum to [13]. In any 
case, this timelike geodesic problem has no relevance to gravitational lensing 
where we are interested in lightlike geodesics between a point and a timelike 
curve, not in timelike geodesics between two points. 

For our purpose, the relevant variational problem is Fermat’s principle. A 
Morse theory based on a version of Fermat’s principle in globally hyperbolic 
spacetimes was invented by Uhlenbeck [78] who, in analogy to her treatment 
of the timelike geodesic problem in the same paper, used finite dimensional 
approximation techniques. Her results were used by McKenzie [45] to formu- 
late conditions under which in a gravitational lensing situation the number of 
images must be odd. As to an infinite dimensional version of Morse theory for 
Fermat’s principle, the most natural starting point seems to be the formalism 
established in Sect. 3.5. Unfortunately, the parametrization fixing condition 
on the trial curves, i. e., condition (c) of Definition 6, together with the fact 
that the trial curves are of type rather than of type leads to many 
technical problems. For that reason Giannoni, Masiello and Piccione [22] [23] 
used a slightly different Hilbert manifold setting as the starting point. This 
approach led, indeed, to a full Morse theory for lightlike geodesics between 
a point and a timelike curve in a globally hyperbolic spacetime. In the rest 
of this subsection we shall review their main result and discuss some of its 
implications. Since the mathematical details are highly technical we have to 
refer to the original articles [22] and [23] for the proofs. 

It was already mentioned that the general setting for treating variational 
problems in terms of infinite dimensional Hilbert manifolds is due to Palais 
and Smale [52] [53]. In this setting one considers differentiable functions 
F : X — >■ M on a real Hilbert manifold X . In applications to variational 
problems, X is the space of trial curves (or, more generally, trial maps) which 
is typically infinite dimensional, F is the functional to be extremized, and the 
critical points of F (i. e., the points where the differential of F has a zero) are 
the solutions of the variational problem. It is the goal to relate the number of 
critical points of F to the topology of X . More precisely, one wants to relate 
the number Nk of critical points where the Hessian of F has index k to the 
fc-th Betti number Bk of X. Formally, Bk is defined for each topological space 
X in terms of the fc-th singular homology space Hi^{X) with coefficients in 
a field F (The results of Morse theory are true for any choice of F ). For the 
definition of singular homology spaces the reader is refered, e. g., to Bold 
[II], p. 32, or to Spanier [75], p. 173. Hk{X) is a vector space over F and Bk 
is, by definition, the dimension of this vector space. Geometrically, Bg is the 
number of connected components of X and, for fc > I, Bk can be interpreted 
as the number of those “holes” in X that prevent a fc-cycle with coefficients in 
F from being a boundary. In particular, if M. is contractible to a point, then 
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Bk = 0 for all fc > 1. Palais and Smale were able to establish the following 
result, see Corollary (3), p. 338, in Palais [52]. 

Assume that F : X — K is of class at least and satisfies the following 
conditions. 

(1) F is a Morse function, i. e., at each critical point of F the Hessian of F 
is non-degenerate. 

( 2 ) F is bounded from below. 

(3) F satisfies the socalled Condition C, also known as Palais-Smale con- 
dition: There is a complete Riemannian metric h on X such that the 
following holds. If S is any subset of X on which F is bounded and | |(iF| | 
is not bounded away from zero, then there is a critical point of F adherent 
to S. Here || • || denotes the norm induced by the metric h. 

Then the Morse inequalities 



Nk>Bk, fc > 0 



( 22 ) 



and the relation 



(23) 

k=0 fc=0 

hold true. The right-hand side of (23) is, by definition, the Euler characteristic 
X of X. If one introduces the notation N+ = -^ 2 i and N- = 

then (23) takes the form 

N+-N_ = x- (24) 

Please note that the Nk and Bk need not be finite. 

The geometric idea behind this result is the following. It turns out that 
the topology of the sublevel set Xt = {x G X \ F{x) <t} remains unchanged 
if t varies over an interval which does not contain a critical value of F (i. e., 
a value taken by F at some critical point). On intervals containing a critical 
value, the topology of the sublevel set changes by “attaching a handle” for 
each critical point, with the special type of the handle determined by the 
index of the Hessian of F at the critical point. This result was first proven by 
Morse for functions on compact (and thus finite dimensional) manifolds, see 
Milnor [46]. In that case Condition C is automatically satisfied. As a matter 
of fact. Condition C was introduced as a sufficient condition for proving the 
same “handle-body theorem” without the compactness assumption. 

If we want to apply this general result to our variational problem, we have 
to check if the assumptions on F are satisfied by the arrival time functional 
T : Hp .^ — ^ R discussed in Sect. 3.5. The Morse index theorem tells us that 
T is a Morse function if and only if 7 does not intersect the caustic of the 
past light cone of p. By Proposition 10, this is the case for almost all p once 
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7 has been chosen. Moreover, the Morse index theorem tells us that Nk is 
the number of past-ponting lightlike geodesics from p to 7 that pass through 
k conjugate points before arriving at 7 , counting each conjugate point with 
multiplicity. The second condition of F being bounded from below is easily 
checked to be true on globally hyperbolic spacetimes. The main problem 
comes with the third condition, i. e., with Condition C. Giannoni, Masiello 
and Piccione [22] [23] found it necessary to modify the whole setting a little 
bit before they were able to verify Condition C. First they considered trial 
curves, rather than F[^ trial curves, i. e., curves which are differentiable almost 
everywhere and whose derivative is locally square integrable. Unfortunately, 
the equation p(A', A') = 0 (almost everywhere) does not define a submanifold 
of ([0, 1], Ad), contrary to the case. Therefore Giannoni, Masiello and 
Piccione replaced this with the equation g(A', A') = —e^ which, for fixed 
e: > 0 , defines a submanifold and considered the limit e —>■ 0 afterwards. 
Second, they dropped the parametrization condition (c) of Definition 6 . This 
has the effect that every critical point of the arrival time functional now 
comes together with all its (ff^) reparametrizations, i. e., r cannot be a 
Morse function on this modified space of trial curves. Therefore Giannoni, 
Masiello and Piccione switched to a new functional Q that is related to the 
arrival time functional in a similar fashion as the energy functional F(a) = 

fg /i(a'(s), a'(s)) ds to the length functional £(a) = ^ h(^a'{s),a'{s)) ds 

in Riemannian geometry. In this modified setting Giannoni, Masiello and 
Piccione were, indeed, able to verify Gondition G, thereby establishing a full 
Morse theory for the variational problem at hand. Their main result can be 
phrased in the following way. 

Theorem 4. (Morse relations for lightlike geodesics) Let {A4, g,T'^) be a 
globally hyperbolic spacetime, fix a point p £ M. and a past-pointing timelike 
C°° curve 7 : I — > M from an open interval I into M. such that p ^ 7 . 
Assume that 7 is closed in M. and does not intersect the caustic of the past 
light cone of p. Let denote the topological subspace of {[0,1], A4) 

consisting of all A € i?^([0, l],Ad) with A(0) = p, A(l) G 7 , g{X , X') = 0 and 
X' past-pointing almost everywhere. Let be the k-th Betti number of 
and Nk the number of past-pointing lightlike geodesics from p to ^ that pass 
through k conjugate points before reaching 7 , counting each conjugate point 
with multiplicity. Then the Morse relations (22) and (23) hold true. 

This result is implied by Theorem 1.7 of Giannoni, Masiello and Piccione 
[23]. Actually, they prove a slightly more general result since they consider 
curves confined to a subset A of Ai with certain properties. Our Theorem 4 
is the version for A = Ad in which case the assumptions placed by Giannoni, 
Masiello and Piccione upon the functional are satisfied, for each pair (p, 7 ), 
if and only if the spacetime is globally hyperbolic. 

The Morse relations have several interesting implications. Information on 
the Nk, i. e., on the number of images in gravitational lensing situations. 
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place restrictions upon the Betti numbers and the other way round. If we 
want to make full use of such results we need, of course, some methods of 
determining the topology of the curve space ^ which is a difficult task in 
general. Before commenting on this problem we list some consequences of the 
Morse relations. Under the assumptions of Theorem 4, the following is true. 

(a) The Morse inequality for fc = 0, i. e. Ng > Bg, implies that there are 
at least Bg past-pointing lightlike geodesics from p to 7 which are free of 
conjugate points, where Bg is the number of connected components of 
This strengthens Proposition 16. 

(b) If Hp.y is non-empty (i. e., jr\I~{p) ^ 0) and not contractible, we 
have Bg > 1 and B^ > 1 for some fc > 1. Then there is a multiple imaging 
situation, Ng + Nk > 2 , and at least one past-pointing lightlike geodesic from 
p to 7 must contain a conjugate point. 

(c) If the number of past-ponting lightlike geodesics from p to 7 is finite, then 
all the Betti numbers Bk must be finite. 

(d) If we write the Morse relation (23) in the form of (24), we find iV_|_-|-fV_ = 
27V_ +x ■ Thus, in a gravitational lensing situation with finitely many images 
the total number of images is odd if and only if the Euler characteristic x is 
odd. 

It is an interesting problem to determine all globally hyperbolic spacetimes 
in which gravitational lensing always leads to an odd number of images. 
(Please recall that the assumption of global hyperbolicity implicitly restricts 
to transparent deflectors.) With the help of Morse theory we have reduced 
this to the problem of determining the Euler characteristic of the curve space 
Hp for each pair (p, 7 ) that satisfies the assumptions of Theorem 4. In some 
special cases, this can be achieved in the following way, please cf. McKenzie 
[45] for a similar investigation. 

Let us assume that the assumptions of Theorem 4 are satisfied and choose 
a timelike C°° vector field U on A4 such that V is tangent to 7 . This is 
possible, see Proposition 5.1 in Giannoni, Masiello and Piccione [22]. Then 
the projection (21) defines a map A 1 — > A = Try o A from Bp^ to the space 

Hp-y. = {a € 7L^([0, l],5y) I A(0) = 7Tv(p), A(l) = 7 ry( 7 ) }. This map is 
obviously continuous. Moreover, it is injective since a past-pointing lightlike 
curve is uniquely determined by its initial point and by its spatial projection. 
In general, however, it need not be surjective because for some curves in 
the target space the lightlike lift may terminate (at the “boundary” of A4) 
before 7 has been reached. This certainly happens whenever there is a particle 
horizon in the sense that for some point q G T~(p) there is no past-pointing 
causal curve from q to 7 . As a simple example where particle horizons occur 
one may consider Minkowski space restricted to the region t > 0 . Let us 
say that the lightlike lifting property is satisfied for the pair (p, 7 ) if the 
map A I — > A = Try o A gives a homeomorphism from Hp .y onto Hp,y. As 
C°° curves are dense in the set of curves and the lifting procedure is 
obviously iL^-continuous, the lightlike lifting property is satisfied if every C°° 
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curve in is the projection of a curve A in An analytical condition 

that guarantees the lightlike lifting property is the socalled metric growth 
condition of Uhlenbeck [78] which was employed by McKenzie [45]. We can 
now prove the following result. 

Theorem 5. (Odd number theorem) Assume that all the assumptions of 
Theorem 4 are satisfied and let V be a timelike C°° vector field on A4 such 
that V is tangent to 7 . Moreover, assume that the lightlike lifting property is 
satisfied for {p, 7 ) and that the space Sy is contractible. Then the number of 
past-pointing lightlike geodesics from p to 7 is (infinite or) odd. 

Proof. If Sv is contractible, the curve space Hp.^ is contractible. To prove this 
one fixes a particular C°° curve Ao G ^ and considers a differentiable map 

4> : [0, 1] X [0, 1] X Sv — Sv such that 0(s,O,a;) = x and 4>{s, l,a;) = Ao(s) 
for all s G [ 0 , 1 ] and x G Sy. The existence of such a map is guaranteed 
since Sy is contractible. (It is true that contractibility is defined in terms of 
homotopies of continuous maps. However, according to a well-known theorem 
every continuous map between two manifolds is homotopic to a map, see, 
e. g., Bott and Tu [7], Proposition 17.8, p. 213.) Now the desired contraction 
<P : [0,1] X — >■ is defined by <P(t,X)(s) = </>(s, t, A(s)). Since the 

lightlike lifting property is satisfied, this implies that Hp.^ is contractible, 
i. e., the Morse relations hold with Bo = 1 and Bk = 0 ioi k > 0 which 
implies y = 1 . If we write the Morse relation (23) in the form of (24), we 
find iV+ -I- 7V_ = 2iV_ -I- 1 , i. e., N- is (infinite or) odd. □ 

This theorem can be phrased as saying that a transparent deflector produces 
an odd number of images provided that there are no particle horizons and the 
spatial topology is trivial. It is, of course, true that particle horizons do occur 
in many cosmological models which are of physical interest. So, in a sense, the 
lightlike lifting property may be considered as a reasonable assumption only 
in gravitational lensing situations where cosmological aspects can be ignored. 

An obvious example where the lightlike lifting property is satisfied is the 
case that P is a complete and hypersurface-orthogonal conformal Killing 
vector field. In that case Fermat’s principle reduces to the geodesic problem 
for a Riemannian metric h on Sy, as outlined at the end of Sect. 3.4, and 
global hyperbolicity is easily checked to be equivalent to completeness of the 
Riemannian manifold (5v, h). Hence, the Morse theory for Fermat’s principle 
reduces to the standard Morse theory for Riemannian geodesics. 

It is an open problem to determine the Euler characteristic of the curve 
space Hp .^ in cases where particle horizons do occur, i. e., where the lightlike 
lifting property is violated. Apparently no results in this direction exist so- 
far. The socalled “chronological homotopy theory” employed by Woodhouse 
[87] might be of help in this connection. The latter is closely related to the 
Lorentzian fundamental groups of Smith [74] and to the notion of “future 
one-connectedness” of Flaherty [15] [16] which is also discussed in Beem, 
Ehrlich and Easley [4]. 
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The formalism presented in this subsection has a (much simpler) analogue 
in the quasi-Newtonian approximation of gravitational lensing. A Morse the- 
ory for the latter situation was developed by Fetters [61] [63]. In that case 
the space of trial curves is genuinely finite dimensional, so one can apply 
the techniques of Morse without having to consider a limit — >■ oo. In par- 

ticular, Fetters [61] used this formalism to prove an odd number theorem. 
Already earlier, it was shown by Burke [9] that in the quasi-Newtonian ap- 
proximation every transparent deflector produces an odd number of images. 
Burke used a fairly simple argument from differential topology, rather than 
Morse theory, cf. Schneider, Ehlers and Falco [70], p. 172, and Lombardi [42]. 
A similar argument will be used in the next section to prove an odd num- 
ber theorem, without invoking Morse theory, for asymptotically simple and 
empty spacetimes, see Theorem 6 below. 

It should be mentioned that, actually, there are several gravitational lens 
candidates where an even number of images is observed. Usually astronomers 
are not troubled by this fact because they found good reasons to assume that 
in those cases one of the images is too faint to be seen. Also, it might be 
possible that one image is hidden behind the deflector, or that two images 
are so close together that they are mistaken for being just one image. 

5 Gravitational Lensing in Asymptotically Simple 
and Empty Spacetimes 

In elementary optics one often considers “light sources at infinity” which are 
characterized by the fact that all light rays emitted from such a source are 
parallel to each other. In this section we want to introduce the notion of “light 
sources at infinity” for general-relativistic spacetimes. To that end we have 
to restrict to a special class of spacetimes called “asymptotically simple and 
empty” . Roughly speaking, an asymptotically simple spacetime is a spacetime 
for which the notion of “(future- or past-pointing) light rays going out to 
infinity” makes sense. The following definition, which is essentially due to 
Fenrose [54], puts this vague idea into precise form, cf., e. g. Hawking and 
Ellis [29], p. 222. 

Definition 8. A spacetime {Ai,g, T~^) is called asymptotically simple if there 
is a strongly causal spacetime {A4,g,T'^) with the following properties. 

(a) At is an open submanifold of M with a non-empty boundary 8A4 . 

(b) There is a C°° function I? : M — >• R such that M = {p € M\fi{p) > 0}, 
dM = {p € M\f2{p) = 0}, and the equation g = g holds on M . 

(c) Every inextendible lightlike geodesic in Ai has two endpoints on dAi . 
[Ai, g,T~^) is called asymptotically simple and empty if, in addition, 

(d) there is a neighborhood U of dAi in A4 such that the Ricci tensor of g 
vanishes onU C\Ai. 
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Asymptotically simple and empty spacetimes are good models for isolated 
gravitating bodies. Condition (d) of Definition 8 is a way of saying that, suf- 
ficiently far away from the gravitating body under consideration, Einstein’s 
vacuum field equation is satisfied. This is a reasonable model for a deflec- 
tor producing gravitational lensing as long as cosmological aspects can be 
ignored. 

Conditions (b) and (c) of Definition 8 imply that in an asymptotically 
simple spacetime all lightlike geodesics are complete. Indeed, since on j\4 
the equation g = is supposed to hold, every lightlike g-geodesic be- 
comes a lightlike g-geodesic by changing the affine parameter according to 
ds/ds = 17“^. As Q is zero on dM., this implies that s — ?► ± oo if the geodesic 
approaches . Hence, it is justified to interpret the elements of 9A1 as 
points at infinity or, more precisely, as those points at infinity which can be 
reached along light rays. Thus, our plan to consider “light sources at infinity” 
naturally leads to considering “worldlines” contained in (3A1. 

In view of gravitational lensing, the observation that in an asymptotically 
simple and empty spacetime all lightlike geodesics are complete has the fol- 
lowing interesting consequence. By a well-known theorem (see Hawking and 
Ellis [29], Proposition 4.4.5) a complete lightlike geodesic must contain a pair 
of conjugate points if the weak energy condition and the socalled “generic 
condition” are satisfied along this geodesic. By Proposition 7, the occurence 
of conjugate points gives rise to multiple imaging. This result may be inter- 
preted as saying that in almost all physically reasonable spacetimes which 
are asymptotically simple and empty multiple imaging takes place. 

Before turning our attention to “light sources at infinity” we have to 
recall some basic facts about asymptotically simple and empty spacetimes. 
First we use part (b) of Definition 8 to define a vector field Z on A4 by the 
equation dD = g(Z, • ). It is well-known that condition (d) of Definition 8 
implies that Z is non- vanishing and g- lightlike at each point of dM. For a 
proof we refer to Hawking and Ellis [29], p. 222. (Please note that Hawking 
and Ellis include the assumption of dQ having no zeros on dM into the 
definition of asymptotically simple spacetimes. However, it is a well-known 
result of Penrose [54] that, with the additional assumption (d) of asymptotical 
emptiness, this property must be automatically satisfied.) As a consequence, 
dM is a g-lightlike hypersurface of M, ruled by the integral curves of Z which 
are g-lightlike geodesics (up to parametrization) . Those lightlike geodesics are 
called the generators of dM. 

In combination with assumption (c) of Definition 8, the property of dM 
being a g-lightlike hypersurface implies that dM has two connected com- 
ponents: (pronounced “scri plus”) where future-pointing g-lightlike geo- 

desics terminate and (pronounced “scri minus”) where past-pointing g- 
lightlike geodesics terminate. - We now state an important proposition, es- 
sentially due to Geroch [21], which determines the global structure of asymp- 
totically simple and empty spacetimes. 
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Proposition 18. Let {Ai, g,T'^) he an asymptotically simple and empty 
spacetime. Then {Ai^g,T^) is globally hyperbolic and every Cauchy surface 
is homeomorphic to Either component of dM can he diffeomorphi- 
cally mapped onto S'^ xM. in such a way that each generator of is mapped 
onto an W-line. Here denotes the 2-dimensional sphere. 

If the reader wants to verify the proof of this proposition he or she should 
consult Newman and Clarke [49] who clarified a subtlety overlooked in the 
original work of Geroch [21] and in Hawking and Ellis [29], Proposition 6.4.9.. 

After these preparations we are now ready to discuss gravitational leas- 
ing situations with light sources at infinity. To that end we consider, in an 
asymptotically simple and empty spacetime, a sequence of timelike C°° curves 
7 „ : I — > A4 that approach, for n —> oo, a curve 7 : I — > J~ . We want 
to assume that 7 is an immersed curve of class at least, and that the 
limit is in the sense, i. e., that not only lim 7 n(s) = 7 ( 3 ) in At but 

n—^oo 

also lim 7 ^(s) = 7 '(s) in TA4. Since Y„(s) is g-timelike and thus g-timelike, 

n— >-oo 

7 '(s) is either 5 -timelike or g-lightlike. The first case is impossible, since JT~ 
is a lightlike hypersurface with respect to g, and the second case is possible 
only if 7 '(s) is tangent to a generator of JL~. We are thus led to the following 
conclusion. In an asymptotically simple and empty spacetime, the worldline 
of a light source at infinity is to be identified with (a section of) a generator 
of J-. 

Please note that this does not mean that light sources at infinity move at 
the speed of light. The (physical) metric g is not defined on dAt, i. e., it does 
not make sense to speak of the causal character of a curve 7 : I — JT~ with 
respect to g. The (unphysical) metric g is but a formal device to introduce a 
geometric structure on the set of points at infinity. The causal character of 
curves in dAl with respect to g has no direct physical interpretation. 

Henceforth we restrict to light sources at infinity with inextendible world- 
lines, i. e., to (maximal) generators of J~ . From Proposition 18 we know that 
the set of generators of J~ is a manifold diffeomorphic to the 2-sphere 5^. 
Hence, the set of all light sources at infinity is in one-to-one correspondence 
with the points of 5^. On the other hand, we can consider for any p £ At the 
set of all one-dimensional g-lightlike subspaces of TpAt. This, again, gives a 
manifold diffeomorphic to which may be called the sky at p. Clearly, each 
point of this manifold determines a 5 -lightlike past-pointing geodesic through 
p uniquely up to parametrization (i. e., it determines a light ray arriving at p), 
and vice versa. Hence, the points of this manifold can, indeed, be identified 
with the points at the celestial sphere of an observer at p. This construction 
defines for each p € Ai a C°° map 

fp-.S^^ 52 (25) 

by assigning to each point x of the sky at p a light source fp{x) at infinity by 
extending the lightlike geodesic tangent to x until it reaches ~ 5^ x M 
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and projecting onto the first factor afterwards. Henceforth we refer to this 
map fp as to the lens map at p for light sources at infinity. The lens map can 
be written in an obvious way with the help of the exponential map. Based 
on the same idea, one may try to establish a similar lens map in arbitrary 
spacetimes. The problem is that in the general situation there is no natural 
“source sphere”, i. e., no analogue of the sphere at infinity. Nonetheless, a 
kind of general lens map can be established, as was recently demonstrated by 
Frittelli and Newman [19]. Their formalism, which is based on the Hamilton- 
Jacobi equation for families of light rays, will not be used in this article. 




Fig. 6. In this picture the smaller sphere is meant to represent the sky at p and 
the bigger sphere is meant to represent light sources at infinity. The lens map (25) 
assigns to each point x of the sky at p a light source fp{x) at infinity. If fp is not 
one-to-one, then there is multiple imaging for light sources at infinity. 



The lens map (25) obviously gives all informations on how light sources 
at infinity are seen by an observer at p. For each light source at infinity, 
represented by a point y G S'^, the set fp^{y) gives all points at the sky 
of p where this light source is seen, see Fig. 6. If fp^{y) consists of more 
than one point, then there is multiple imaging. Please recall that y G is 
called a regular value of fp if for all y G with fp{x) = y the differential 
Txfp : TxS'^ — > TyS'^ has maximal rank (i. e., is surjective). It is easy to 
check that p is a regular value of the lens map if and only if the generator 
represented by y does not insersect the caustic of the past light cone of p. 
Here and in the rest of this subsection the term “light cone” always refers to 
the light cone in M with respect to the metric g since we need the extension 
of the “physical” light cone to . Owing to the well-known Theorem of Sard 
(see, e. g., Guillemin and Pollack [26], p. 39, or Hirsch [31], p. 69) almost all 
points y G S'^ are regular values of the lens map. This is in agreement with 
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Proposition 10 according to which the situation that a light source passes 
through a caustic point is to be viewed as “exceptional”. By compactness 
of S^, fp^iy) is finite for any regular value y. Hence, the observer at p sees 
finitely many images of each light source at infinity that does not pass through 
the caustic of the past light cone of p. 

We shall now establish the remarkable result that, for each light source 
at infinity which does not pass through the caustic of the observer’s past 
light cone, the number of images is odd. Based on this observation we shall 
then prove that the same result is true for a light source moving inside M 
provided that its worldline is inextendible and does not approach This 
odd number theorem for light sources in asymptotically simple and empty 
spacetimes will emerge as an application of elementary differential topology; 
in particular, it will not be necessary to invoke the considerable technical 
apparatus of Morse theory that was discussed in Sect. 4.2 above. 

The proof of our odd number theorem is based on a simple idea. How- 
ever, the details of the proof look a little bit involved since it is necessary 
to introduce some cumbersome notation. For that reason we first give the 
general idea upon which the proof is based. This general idea of an odd- 
number argument is popular with astronomers who usually present it on the 
understanding that space and time can be described in a Newtonian fashion. 
Then the argument goes like this (cf. Schneider, Ehlers and Falco [70], p. 
176). Consider all light rays that reach at a particular instant an observer at 
the point p. Parametrize these light rays with time, in a past-pointing way 
such that they have t = 0 on their arrival at p. Then for all t > 0 we can 
consider the wavefront Wt, defined as the set of all points in space that are 
crossed by at least one of the considered light rays at the time t. For small 
t, Wt is a sphere around p. For larger t, Wt will develop self-intersections 
because the light rays are influenced by gravitating masses in the universe. 
Whenever the wavefront crosses a light source, given by a point moving in 
space in dependence of time, this gives rise to an image of the light source 
seen at p. As the wavefronts develop as continuous deformations of a sphere, 
the following definition makes sense. We say that the light source is outside of 
the wavefront Wt if the position of the light source at t can be connected to p 
by a curve that intersects Wt an odd number of times and inside otherwise. 
Here we have to restrict to curves that intersect the wavefront transversely 
and only at points where the wavefront is an immersed submanifold, i. e., not 
at caustic points. Now we assume that the light source itself never touches a 
wavefront with a tangent velocity vector and that it stays away from caustic 
points. Then, whenever the light source meets a wavefront, it changes from 
outside to inside or vice versa. For small t the light source is outside. For 
large t it is inside, provided that all light rays go out to infinity and the light 
source does not go out to infinity. This implies that, in total, the wavefront 
crosses the light source an odd number of times, i. e., that there is an odd 
number of images. 
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Instead of the somewhat vague notion of being inside or outside one may 
use he socalled “mapping degree” from differential topology. Using this ter- 
minology the above argument was written down, apparently for the first time, 
in the introduction of McKenzie [45] . Gottlieb [25] tried to translate this gen- 
eral idea into a Lorentzian manifold setting and came to the conclusion that 
the argument does not work without quite special and unrealistic assump- 
tions. However, this conclusion is largely based on the fact that Gottlieb 
implicitly restricts to multiple imaging situations without time delay. What 
we want to show in the following is that, with the help of the mapping de- 
gree, the above-mentioned odd number argument can be made into a precise 
theorem in asymptotically simple and empty spacetimes. The crucial point 
is, of course, that in this case it is guaranteed that all light rays go out to 
infinity. As an aside, we mention that a slightly different mapping degree 
argument was used by Lombardi [42] to prove an odd number theorem in the 
quasi-Newtonian approximation formalism. This is closely related to Burke’s 
[9] original proof of an odd number theorem, again in the quasi-Newtonian 
approximation, using the index of vector fields. The latter is also discussed 
in Schneider Ehlers and Falco [70]. 

The essential tool for our proof is the mapping degree of a C ^ map 
F ■. Ml — > M 2 between oriented manifolds of the same dimension. In the 
following we briefly summarize the basic facts about this notion. For more 
background material the reader is refered to Westenholz [83] , Guillemin and 
Pollack [26] and Bold [11] who, in this order, treat the subject with an in- 
creasing amount of abstract mathematics. First we recall that, by the Sard 
theorem already mentioned above, almost all points y € M 2 are regular val- 
ues of F. If, for such a regular value y, the pre-image F~^{y) is contained in 
a compact set and thus finite, the local degree of F at y can be defined by 
the equation 



where sgn(a:) is, by definition, equal to -1-1 if the tangent map T^;F preserves 
orientation and equal to —1 if T^F reverses orientation. One can then estab- 
lish the following facts. 

(a) If Ml and M 2 are both compact without boundary, degy(F) is the 
same for all regular values y. In this case one calls deg(F) = degy{F) simply 
the degree of F. The degree is a homotopic invariant in the sense that a 
second map F : Mi — > M 2 has the same degree as F if and only if F 
can be continuously deformed into F. 

(b) If Ml and M 2 are compact with smooth boundaries, and if F restricts 
to a map OF : dMi — >■ dM 2 , then 



degy(F) = sgn(x) . 



(26) 



X e F-^{y) 



degy{F) = deg{dF) 



(27) 



for all regular values y G M 2 \ dM 2 - 
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We are now ready to state and prove the desired odd number theorem. 

Theorem 6. For any point p in an asymptotically simple and empty space- 
time the following holds true. 

(a) The lens map (25) has degree one, deg(/p) = 1. As a consequence, each 
generator of J~ that does not pass through the caustic of the past light cone 
of p can be reached from p along an odd number of lightlike geodesics. (We 
have already seen above that this number is finite.) 

(b) Let I be an open interval and 7 : I — > A 4 a timelike embedded C°° curve 
such that 7 is closed in M. and has no endpoint on J~ . ( This is a way of 
saying that 7 is inextendible in A 4 and, in the past direction, does not go out 
to infinity approaching the velocity of light.) If 7 does not pass through the 
caustic of the past light cone of p, then the number of past-pointing lightlike 
geodesics from p to ^ is finite and odd. 

Proof. Fix a curve 7 that satisfies the assumptions of (b). We want to con- 
struct a timelike vector field V on A 4 that is tangent to 7 and smoothly 
extends to the vector field Z on J~ given by g(Z, •) = dL 2 . First we choose a 
future-pointing timelike C°° vector field V\ on some neighborhood of 7 in 
M. such that Vi is tangent to 7. This is possible since 7 is an embedding and 
7 is closed in j\ 4 ; the proof can be patterned after the proof of Proposition 
5.1 in Giannoni, Masiello and Piccione [22]. Then we choose a future-pointing 
timelike vector field V2 on an open subset IA2 oi M. whose closure in AA 
covers J~ such that V2 smoothly extends to Z on J~ . The existence of such 
a vector field V2 follows from the fact that J~ is a closed embedded 5-light- 
like submanifold of }A. Since 7 does not approach J~ , the domains of Vi 
and V2 can be chosen disjoint. Finally, we choose a future-pointing timelike 
vector field V3 on an open subset U3 oi M. such that U\,U2 and U3 
cover M. and the closure 0HA3 in AA has void intersection with 7 and with 
fl~. Then we get the desired vector field V by combining Vi, V2 and V3 
with a partition of unity. For this vector field V , we consider the projection 
(21) onto the 3-manifold of integral curves of V . By Proposition 18, Sy is 
homeomorphic to Since, by the theorem of Moise [47] already mentioned 
above, any 3-dimensional topological manifold admits a unique differentiable 
strcuture, Sy must even be diffeomorphic to or, what is the same, to the 
open unit ball B = |a;| < l}. Since V extends to the vector field Z 

on J'~ that is tangent to the generators, wy extends to a map 

Tty.AAAj- — >B (28) 

between manifolds with boundaries. - Now the vector field V defines a time- 
like vector Vp at the point p. We choose three spacelike tangent vectors 
Ei,E2,E3 at p with g(Ep,Vp) = 0 and g[Ep,Ey) = -g{Vp,Vp) 5 ^^ ■ Then 
each a; G defines a past-pointing lightlike 5-geodesic by initial condi- 
tions Ax(0) = p and A(,(0) = x^Ei -\- x'^E2 -\- x^E^ — \x\Vp. Since we use the 
5-affine parametrization, rather than the 5-affine parametrization, A^ arrives 
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at J~ at a finite parameter value v{x) G M. Having established the projec- 
tion (28), the map x i — > Xx and the map x i — > v{x), we are now ready to 
prove part (a) of the proposition. For each x G S'^, the curve ny o in B 
starts at the origin and reaches the boundary at some parameter value v{x). 
Hence, for each r G ]0,1], there is a unique parameter value u(r,x) < v(x) 
such that [ify (Aa;(s)) I is smaller than r for 0 < s < u{r,x) and equal to 
r for s = u{r,x). Then the assignment x i — >■ ^r{x) = i7fv(Ax(w(r, x))) 
gives a C°° map ■ S'^ — 5^. For r = 1 we get the lens map (25), 
^1 = fp- For r sufficiently small, is an orientation preserving diffeomor- 
phism, hence deg(^r-) = 1- This follows from the fact that the light cone 
looks like the Minkowski light cone if we restrict to sufficiently short light 
rays. Since the degree is a homotopic invariant, letting r vary from some 
small value up to the value 1 shows that the lens map has degree one. Now 
each generator of is the pre-image of a point y G = dB under the 
map 7fy. This point y is a regular value of the lens map if and only if the 
generator does not meet the caustic of the past light cone of p. Under this 
condition is finite, by compactness of 5^. Let us denote by n± the 

number of points x G fp^{y) such that sgn(a;) = ±1. Then the definition 
of the degree implies n+ — n_ = deg(/p). Since deg(/p) = 1, this gives 
n+ -|- n_ = 2n_ -|- 1, i. e., the number of points in fp^{y) is odd. - Now we 
prove part (b). To that end we consider the map F : B — > B defined by 
F{x) = Wy(^Xx{v{x) |a;|)). Clearly, the restriction of this map to the bound- 
ary gives the lens map dF = fp : dB = — >■ dB = S'^. If our curve 7 

does not meet the caustic of the past light cone of p, the point y G B with 
^v^{y) = 7 is a regular value of F. Hence, by (27), degy{F) = deg(/p) = 1. 
By compactness of B, the set F~^{y) is finite. As in the proof of part (a), we 
get n^ + ri- = 2n_-|-l, where n± denotes the number of elements x G F~^{y) 
with sgn(a;) = ±1. Thus, the number of elements in F~^{y) is odd. □ 

The fact that the lens map has degree one implies, in particular, that the lens 
map is surjective which was not obvious from the start. The reader should 
also consult Kozameh, Lambert! and Reula [38] who state in Lemma 1 a 
result which is essentially equivalent to the fact that, in our terminology, the 
lens map has degree one. This paper [38] belongs to a long series of articles 
by Ted Newman, Carlos Kozameh and various coauthors on studying general 
relativity in terms of the Hamilton- Jacobi equation for families of lightlike 
geodesics. For reviews on this topic we refer to Chap. 7 of Joshi [33] and to 
Kozameh [37]. In particular, these authors have found interesting results on 
the geometry of “light cone cuts at infinity”, i. e., of intersections of light 
cones with or in an asymptotically simple and empty spacetime, 
which are of relevance in view of gravitational lensing. 

As stressed already at the beginning of this subsection, an asymptotically 
simple and empty spacetime is a good model for an isolated gravitating body, 
i. e., if cosmological aspects are ignored. Rudiments of cosmology can be 
introduced by modifying condition (d) of Definition 8. E. g., one could require 
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Ric = Ag near dAi with a positive or negative cosmological constant A, 
rather than the vacuum field equation Ric = 0 . The resulting spacetimes are 
called asymptotically deSitter for R > 0 and asymptotically anti-deSitter for 
d < 0. It was verified already by Penrose [54] that then dA4 is no longer 
5 -lightlike but rather g-spacelike for d > 0 and g-timelike for d < 0. In 
the latter case we can consider immersed worldlines in dM. which are g- 
timelike. For such “light sources at infinity” in an asymptotically anti-deSitter 
spacetime we have Fermat’s principle in the version of Theorem 1, viewed in 
the spacetime (dd,g,T+), at our disposal. This observation was used by 
Woolgar [88] to prove a positive energy theorem for asymptotically anti- 
deSitter spacetimes. 

The class of asymptotically simple spacetimes seems particularly appro- 
priate for discussing gravitational lensing in a Lorentzian geometry setting. 
As only a few results in this direction have been worked out so far, all dedi- 
cated experts are invited to join the work in this interesting field. 
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